Gene Daro_0920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0920 
Symbol 
ID3570077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp996174 
End bp997322 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content56% 
IMG OID637679378 
Productextracellular ligand-binding receptor 
Protein accessionYP_284146 
Protein GI71906559 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCAC TACATAAGCT ACTACTCGCC AGCATATTGG CGCTGGGCAT GACTCAAACA 
CATGCGGAAG TCGGGGTTAC CGACACCAGT ATCACGCTGG GCATGTCCGC CCCCTTCACC
GGGCCCAACG GGCTGTACGG CATGCAGATG CGGGAAGCCA TCACGGCTCA TTTCGATCAG
CTCAATAAAA GCGGCGGCAT CAACGGGCGA AAACTCGAAC TGATCACGAT CGATGATGGC
TACGAAACCG ATCGCACCCT GGCCAACACC AAAACCCTGA TTCAGGACAA GCAAGTCTTC
GCGCTGATGG GTTATTACGG ATCCACACCG ACGACCGAAG CCATGAACAA GGTCTTCGGT
CCCGCCAAAG TGCCGCTCAT TGGCACAATT TCGGGGGCTG GAACGCTCCG TGAACCCCTA
GCAAGCAACC CAAACAGTCG CTACATGTTC AACATTCGCG CCAGCTATGC CGACGAGGCA
GAAGCGATCG TGAACCAGAT CATCGCGCTG GGCCTGAAAA ACATTGCGGT GTTCTACCAG
AACGACGGAT TCGGAAAATC CGGCCTGGAA GGCGTCACCA ACGCGCTCAA GCGGGCCAAC
CTCGCTCCGG TCGCCGTTGG AACAGTCGAA CGAAACTCTC TTGACGTTGC CAAAGCGGCC
GAAGCAATCA GCAAGACCAA TCCGCAAGCC GTCGTCATGG TCACGCTATA CAAGCCAACT
GCCGCATTCG TCAAAGCCAT GAAGCAACTC GGACAGTTTC CGATGTTCCT GACCCTCTCA
CCGGTCGGTG GCGAGGTCCT AGCGCAAGAG CTGGGCAATG ATGCGCGCGG GATCGGAATT
TCGCAGGTTG TTCCCTACCC CTGGAACGAC ACCATTCCCA TCGTCAAGGA CTATCAGCGC
CTTCTGGACA AGCAAAAGGA CAAGTTCTCC TACTACGGCC TCGAAGGCTA CATCACGGCC
CGTCTGGTGG CAGAAGCACT CAAGAAGGCA GGCAAGGATC TGACCCGCGA AAAACTAGTG
ACCACGCTGG AAGGCATGCA GAACTTTGAC CTAGGGGGTT TCAAGCTCAA CTACAGCCCC
AACAGCCGGC AGGGATCTCG CTATGTCGAA TTGACCGTGG TGGGCGCTGG CGGCAAGGTA
ATCAAATAA
 
Protein sequence
MKPLHKLLLA SILALGMTQT HAEVGVTDTS ITLGMSAPFT GPNGLYGMQM REAITAHFDQ 
LNKSGGINGR KLELITIDDG YETDRTLANT KTLIQDKQVF ALMGYYGSTP TTEAMNKVFG
PAKVPLIGTI SGAGTLREPL ASNPNSRYMF NIRASYADEA EAIVNQIIAL GLKNIAVFYQ
NDGFGKSGLE GVTNALKRAN LAPVAVGTVE RNSLDVAKAA EAISKTNPQA VVMVTLYKPT
AAFVKAMKQL GQFPMFLTLS PVGGEVLAQE LGNDARGIGI SQVVPYPWND TIPIVKDYQR
LLDKQKDKFS YYGLEGYITA RLVAEALKKA GKDLTREKLV TTLEGMQNFD LGGFKLNYSP
NSRQGSRYVE LTVVGAGGKV IK