Gene Daro_4070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_4070 
Symbol 
ID3566911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4366614 
End bp4367669 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content58% 
IMG OID637682542 
Productperiplasmic phosphate binding protein 
Protein accessionYP_287266 
Protein GI71909679 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID[TIGR00975] phosphate ABC transporter, phosphate-binding protein 


Plasmid Coverage information

Num covering plasmid clones65 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATCTC CAATGTCCAG AGCAGCACTG CTAGCCGCCG CGTTGTTTTT ATACGCCAGC 
ACCCTCATCG CCGCAGAACC GATGACCGGT GCCGGGTCGT CTGCCGCCTA CCCGATCTAC
AAGCTGTGGG CGGATCAACT GAAACGTGAA GGCGGTTTCG TGCTGAACTA CGATCCCGTC
GGCTCATCGG CCGGGGTCGA GAGAGCTCGC GCCCGTCAGG CCGATTTCGG CGCCACGGAT
GTCGCACTCA AAAGTGAGCA ACTGGCCAAG GACAACCTTA TCCTCTTTCC AACGGTCATC
ACCGGCGCAG TTCCCGTCAT CAACCTGCCC AAGATGGAAA AACCGCTTGT TCTGGACGGC
CCCACGCTGG CCAGAATCTT CCTGGGGGAA ATTGAGCACT GGGATGCACC TGAGATCCGC
GCCTTGAATC CCGGCCTGAG CCTTCCCGCA AAACCGATAG TCGCGGTCGT CCGCAGCGAT
GGCTCTGGCA CCACTTACAA CTTTGCCGAT TATCTGGCCA AGGTCAGTCC GGCATGGAAA
CAGAAGATGG GCGTTGCCAC GAACCTCAAA TGGCCAGCCT CGTTCACGTC GGTCAAGGGC
AGCAAGGGCG TTGCCGAAGC AGTGAAATCG ACGCCCGGGA GCATCAGTTA CGTGGATTAC
AACTACGTAC TGGATTACAA GCTCACGGGG GCAGCCATGA AGAATGCGGA CGGCGCCATC
GTCGAAGCCG GGCCTTACAC CTTCCGTGAA GCGCTTGCCC AGAGCGTCTG GAAGCAAACC
GGCGACTTCA CTCAGACGCT GACCAATCAG ACCGGGAAAA GCAGCTGGCC CATCACGATG
GGCACTTTTA TCGTCATGCC CCGAGTCTCC AACAATCCGG AACGGACCAT CCAGGTCACC
CGTTTCTTCA CCGAGGCATT CATGCGCGGT GACGACCTTG CGAAACAGGC CAACTTCGTC
CGCCTGCCGT CGATCATCCA GGGGAAAGCC TTCCGGGTGA TTTCGGAAAT CGTCGATGCA
AAGGGCGTTC CCATCGGCAT AAACAGCCTC CGATAA
 
Protein sequence
MQSPMSRAAL LAAALFLYAS TLIAAEPMTG AGSSAAYPIY KLWADQLKRE GGFVLNYDPV 
GSSAGVERAR ARQADFGATD VALKSEQLAK DNLILFPTVI TGAVPVINLP KMEKPLVLDG
PTLARIFLGE IEHWDAPEIR ALNPGLSLPA KPIVAVVRSD GSGTTYNFAD YLAKVSPAWK
QKMGVATNLK WPASFTSVKG SKGVAEAVKS TPGSISYVDY NYVLDYKLTG AAMKNADGAI
VEAGPYTFRE ALAQSVWKQT GDFTQTLTNQ TGKSSWPITM GTFIVMPRVS NNPERTIQVT
RFFTEAFMRG DDLAKQANFV RLPSIIQGKA FRVISEIVDA KGVPIGINSL R