Gene Apar_0496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0496 
Symbol 
ID8413345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp565311 
End bp566720 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content43% 
IMG OID645022064 
Productradical SAM family protein 
Protein accessionYP_003179518 
Protein GI257784301 
COG category[R] General function prediction only 
COG ID[COG4277] Predicted DNA-binding protein with the Helix-hairpin-helix motif 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACTC TTGATAAACT CACCATACTT GCAGATGCGG CCAAATTTGA CGCGGCATGT 
ACCTCCTCTG GTGTAGACAG AGATCCGCAG GCAGGAAAGA TAGGCATGGC TTCGACGGCG
GGTTGTTGCC ATTCATTTAC TCCTGACGGC CGCTGCATTA CGCTGCTAAA AGTTTTGCTT
TCTAATGCTT GTTGCTATGA CTGCGCTTAC TGCGTCAATA GATCGTCTGC TCAAACTAAA
CGTGCTACCT TTACACCACA AGAGTTAGCA GAACTTACCG TAGATTTCTA TAAAAGAAAC
TACATAGAAG GACTTTTTCT TTCCTCAGGT GTTATAGGGT CGCCGGATCA CACCACGGAA
CTCATGATTG AATGTCTCTC CTACCTTAGA AATGAACTGC TATTCAACGG CTATATTCAT
GCGAAGGTAA TTCCCGGAAC AGACTCAGAT CTTATTGACG CTATTGGCCG TTTGGCAGAC
AGACTCTCCG TCAACCTGGA ATTACCTAGC TCAACCTCAC TTACTAAACT TTGCCCGCAC
AAAAATGCTG AAACGGTTAC TAAACCCATG TCTTTTATTC ACCACCTACG CGTCTCTGAA
GAAAGACAAT TACTCGAAGC TAAAAATACG TCAAAGAGTA TTGTGAAGTC TGCTGGCAAA
TCTAAAGGTA TTATTATTCC AAGTCAGAAG CAGATTATAA AAGAAGGTTT AGCCTTACGT
TCTAATTGTT TGAAGAACAC TTTTATGCAT TCATCAAGGG TGTCTTCGAG TAAAAGGTCT
TTCTCGCCTG CTGGGCAGTC AACACAAATT ATTATTGGGG CTTCTCCAGA GTCAGATAAT
CAAATATTGC ATTTATCACA AGCGCTTTAC CAACAATTTG AGTTAAAACG AGTATTTTTT
TCTGCCTATA TTCCTGTTAT GGAAGATGAT CGTCTTCCTG AGTTAGATAC CCCTGTTCCC
CTCCGTAGAG AGCATCGCCT TTACCAAGCT GATTGGCTTA TGCGCTACTA TGCATTTTCT
CCTGATGAAC TTGTTTCTCC AGAAGCGCCT TGGCTTGATT TAGAAGTTGA CCCAAAGCTT
GGATGGGCAT TGGCTCATCT TGATCAATTT CCTATAGAAA TTATGGATGC ACCACTAGAA
ATACTTTTAC GAGTGCCCGG AATTGGCCCT ACCGGTGCAC GAAGAATCAT TGCCGCAAGA
AGACGATCTC GTCTTACTTT TGATGATCTT AAACGGCTGG GAATACAGCT CAAACGATCC
CATCACTTTC TCACCTGCTG CGGAGTAAGA GATGCTACAG CCCCACTTGA CCAAGAACTA
ATTAAACAGA GAGTAATTGC TGATGCTAAA GCAAGTTCAT ACAACAAAAC AAGACGTCGC
ATTGAATCCT CTCAACTCAG ACTCTTCTAA
 
Protein sequence
MDTLDKLTIL ADAAKFDAAC TSSGVDRDPQ AGKIGMASTA GCCHSFTPDG RCITLLKVLL 
SNACCYDCAY CVNRSSAQTK RATFTPQELA ELTVDFYKRN YIEGLFLSSG VIGSPDHTTE
LMIECLSYLR NELLFNGYIH AKVIPGTDSD LIDAIGRLAD RLSVNLELPS STSLTKLCPH
KNAETVTKPM SFIHHLRVSE ERQLLEAKNT SKSIVKSAGK SKGIIIPSQK QIIKEGLALR
SNCLKNTFMH SSRVSSSKRS FSPAGQSTQI IIGASPESDN QILHLSQALY QQFELKRVFF
SAYIPVMEDD RLPELDTPVP LRREHRLYQA DWLMRYYAFS PDELVSPEAP WLDLEVDPKL
GWALAHLDQF PIEIMDAPLE ILLRVPGIGP TGARRIIAAR RRSRLTFDDL KRLGIQLKRS
HHFLTCCGVR DATAPLDQEL IKQRVIADAK ASSYNKTRRR IESSQLRLF