Gene Daro_0081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0081 
Symbol 
ID3569720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp98595 
End bp99593 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content61% 
IMG OID637678516 
ProductABC transport system substrate-binding protein 
Protein accessionYP_283310 
Protein GI71905723 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones66 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCC TCAAGTCCGC AAGTCGTACC CGGCGCTGGC TAACCCGACT GGTCCTACCC 
GTCGCTGCTG CCTTCAGCCT CACCGCCCAG GCGGCCCAAC CCCTGAAAAT TGGCTATAGC
GACTGGCCGG GCTGGGTGGC CTGGCAGGTT GCCATCGACA AAGGCTGGTT CAAGGAAGCG
GGCGTCGATG TCAAATTTGA GTGGTTCGAC TATTCGGCCT CGATGGATGC CTTCGCCGCC
GGGAAGATCG ACGCCGTGAC GATGACCAAT GGCGACGCAC TGGTGACTGG CGCTGGTGGT
GCCAAGAGCG TGATGCTGAT GCTGACCGAC TACTCCAACG GCAACGACAT GATCGTCGCC
AAACCGGGCA TCAAGTCGAT CAAGGATCTC AAGGGCAAGA AGGTCGCCGT CGAGCAAGGT
CTGGTCGAAC ACCTGCTGCT GCTCAATGGT CTGAAGAAGG CCGGCATGAA GGAATCGGAC
GTCACGCTGG TCAACGCCAA GACCAACGAA ATGCCCCAGA TGCTGACCGC CAAGGACATC
GCCGCCATCG GCGCCTGGCA ACCGGTATCC GGTGAGGCCA TGAAGGCTGT ACCGGGTTCG
AAGCCGATCT ATACCTCGGC CGACGAAGCC GGCCTGATTT ACGACGTGCT CGCCGCCAAC
CCGGCCAGCG TCAAGGCCCG CCGCGCCGAC TGGCAGAAGG TGGTCAAGGT TTGGGACAAG
GTTGTCGCCT ATATCGAGGA CCCGAAAACC CAGCCCGACG CCGTGAAGAT CATGGCCGCC
CGCTCCGGCA TCAGCCCGGT CGAGTACCTG CCGCTGCTCA AGGGCACCAA GCTGCTCTCC
CTGGAAGAAG GCAAGAAGAT CTACGTCAAG GGCGATGGCT TCAAGACGCT CTACGGCTCG
ACCAAGATCG TCGACTCGTT CAACGTGGCC AACCAGGTTT ACAAGGCCCC GGAGAAGATC
GACGGCTATC TGGATTCCTC CTTCACCAAC GGCAAATAA
 
Protein sequence
MNTLKSASRT RRWLTRLVLP VAAAFSLTAQ AAQPLKIGYS DWPGWVAWQV AIDKGWFKEA 
GVDVKFEWFD YSASMDAFAA GKIDAVTMTN GDALVTGAGG AKSVMLMLTD YSNGNDMIVA
KPGIKSIKDL KGKKVAVEQG LVEHLLLLNG LKKAGMKESD VTLVNAKTNE MPQMLTAKDI
AAIGAWQPVS GEAMKAVPGS KPIYTSADEA GLIYDVLAAN PASVKARRAD WQKVVKVWDK
VVAYIEDPKT QPDAVKIMAA RSGISPVEYL PLLKGTKLLS LEEGKKIYVK GDGFKTLYGS
TKIVDSFNVA NQVYKAPEKI DGYLDSSFTN GK