Gene Daro_1056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1056 
Symbol 
ID3568122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1159506 
End bp1160519 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content62% 
IMG OID637679518 
Productextracellular solute-binding protein 
Protein accessionYP_284282 
Protein GI71906695 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value0.835874 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGC AATTAGCCCT GGTCGCGCTG GCTGTGCTGG CGAGTTTCGC AACGCAGGCG 
CAGGAGAAAG TCCTCAACCT GTATTCCGCC CGCCACTACC AGACGGACGA GAAGCTCTAC
GACAACTTCA CCAAGCAGAC CGGTATCAAG ATCAACCGAA TCGATGGCAA GGAAGACGAG
CTGATGGAGC GCATTCGCAA CGAAGGCGCC AACAGCCCGG CCGACGTGTT CATCACTGTC
GATGCCGCCC GTCTGGCCAA TGCCGATGCC CTCGGGTTGT TTGCGCCGGT CAAGTCAAAG
CTGCTGGAGA GGCGCATTCC GGCCCACCTG CACACTGACA CCTGGTTTGC CTTCTCGACC
CGGGCCCGCG TCATCATCTA CAACCGCAGC GCGGTCAAGG CCGAGGATGT GGCGACCTAC
GAGTCGCTGG CCGACCCGAA GCTGAAGGGC AAGCTGTGCA GCCGCTCGGG TTCGCATCCG
TACAACCTGT CGTTGGTTGC CTCGCTGATC GCGCATGACG GCGAAGCGAA GACGGAGGAA
TGGGCCAAGG GCATGGTCGC CAATTTCGCC CGGGCGCCTA AGGGCGGCGA TACCGACCAG
ATCAAGTCGG TCGCCCTCGG CGAGTGCGGC GTGGCGGTGT CGAATACCTA CTACCTGGCC
CGCCTGATCC GTTCCGACAA GGTCGATGAG CGCCGCATGA TGGAGCGCGT CGGCATCGTC
TGGCCGAACC AGGCCAATCG CGGCACGCAC ATCAACATTT CCGGTGCCGG CGTGGTCAAG
ACCTCGAAGA ACGTCGAGGC GGCTGTGAAG TTCCTCGAAT ACCTGGCCAG CGACGAGGCT
CAGCGCTACT TCGCCGACGG CAACAATGAA TGGCCGGTGG TGGCCAGCGT GGTGACCGGC
AATCCGGCGC TGGAGGCGAT GGGCAAGTTC AAGGCTGACA CCCTGCCAAT CGGCGTGCTG
GCCAAAAATG TCGTGGCGGC GCAAAAGCTG CTCGACCGCG CCGGTTATCG TTAA
 
Protein sequence
MKKQLALVAL AVLASFATQA QEKVLNLYSA RHYQTDEKLY DNFTKQTGIK INRIDGKEDE 
LMERIRNEGA NSPADVFITV DAARLANADA LGLFAPVKSK LLERRIPAHL HTDTWFAFST
RARVIIYNRS AVKAEDVATY ESLADPKLKG KLCSRSGSHP YNLSLVASLI AHDGEAKTEE
WAKGMVANFA RAPKGGDTDQ IKSVALGECG VAVSNTYYLA RLIRSDKVDE RRMMERVGIV
WPNQANRGTH INISGAGVVK TSKNVEAAVK FLEYLASDEA QRYFADGNNE WPVVASVVTG
NPALEAMGKF KADTLPIGVL AKNVVAAQKL LDRAGYR