Gene Daro_2398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2398 
Symbol 
ID3567582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2585030 
End bp2586472 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content51% 
IMG OID637680865 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_285604 
Protein GI71908017 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value0.0820285 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGATC GCTACGAAGC CTCAGCCCGT ATCTACGTGG ATACACAATC CATTCTTAAG 
CCGTTGATGT CCGGACTGAC CGTTCAGCCG AACATAGAAC AACAAGTGAT GATGCTCAGT
CGAACGCTAA TCAGCCGACC CAATATCGAA AAATTGATTC GGATGGCTGA TCTTGACCTA
AAGATTCAGG GGAAACGCGA ACAAGAAGCA CTGATTGACG AGTTAATGAA GACTCTCGTG
ATTCAAAGTT TGGGCCGCGA CAACCTGTAC ACCATCGCCT ACCGCGACAC TGATCCTTCA
AAGGCGCAAC GGGTCGTACA GGCTCTGGTG TCGATTTTTG TTGAATCAAG CTTAGGGGAC
AAGCGGCAGG ACAGTGACTC CGCACTAAAA TTCATTGATG AACAAATAAG GAACTACGAA
AAGAAACTCG AAGATGCAGA AAGCCGCCTG AAGGATTTCA AGCTTCGCAA CGTTGAGCTG
AATACCGGCG AAGGCAGAAG CGGCATTGAT AAACTCTCCG AATTGACCAA TGTTTTGAAT
AGTTCAAGAC TTGCACTTCG CGAAGCCGAG AATTCACGTG ATGCACTGCG CAGGCAAATC
CTCGGTGAAG AGCCGGTGTT GTTGCCAGAA TCATCTAGCG GCGACTCAGG TGTTTCATTA
CCCGAGATTG ATGGCAGACT GGATGTACAG AAGCGCAACC TTGACAATTT ATTACAGCGC
TACACTGATC AACACCCCGA TGTCATTGGA ACACGTCGAC TGATCAAGGA TATTGAAGAG
CAGAAACGTC AGGAGCTCTT GGCTCGCAAA AAATTTGCAG CGGCCAATCC TGGAGCCTCC
GTCAGCAACA ACCCCGTGTA TCAACAACTA AAGGTGTCGT TAGCCGAGTC CGAAGCCAAT
GTAGCCTCGT TGCGCGCCCG TGTCAGCGAG TACGAAACTC GTTACAAGCG CACTACTGAC
TTGCTCAAGA CACAACCGCA ACTCGAAGCA GAATATACCC AACTTAATCG CGACTACGAC
ATTCACAAGA AAAACTACGA ACAGCTGGTT ACCCGCCGAG AAGCCGCCGA ACTGTCCGGC
GATCTTGAGT CAACAGGTTC TGGCGCTGAC TTTCGGCTAA TCGACCCGCC ACGGGCTTCG
TCAAAGCCTG TAGCCCCCAA CCGCCTCTTG CTCCTTCCTG GCGGACTTGC CTTGGCTTTG
GCTGCGGGCT TGTTTGTTGC ATTTGTCGCT AGCCAGATTC GCCCGGTATT CTTTGATGGC
AAGACCCTTC GCGAAGTATC AGGCTTGCCT CTTCTTGGCA CGGTTTCACT ACTGCCAAAC
CCCGTTCGAA AACAGAAGGA ACGAGCAAGT CTGAAGAGAT TTCTCATCGC CACGTTTGGC
CTAATTTCAG CGTATGGCTT TGGCATAGCC GCGCTTTTTA TACTCACCCA ACGCGCAGCC
TGA
 
Protein sequence
MPDRYEASAR IYVDTQSILK PLMSGLTVQP NIEQQVMMLS RTLISRPNIE KLIRMADLDL 
KIQGKREQEA LIDELMKTLV IQSLGRDNLY TIAYRDTDPS KAQRVVQALV SIFVESSLGD
KRQDSDSALK FIDEQIRNYE KKLEDAESRL KDFKLRNVEL NTGEGRSGID KLSELTNVLN
SSRLALREAE NSRDALRRQI LGEEPVLLPE SSSGDSGVSL PEIDGRLDVQ KRNLDNLLQR
YTDQHPDVIG TRRLIKDIEE QKRQELLARK KFAAANPGAS VSNNPVYQQL KVSLAESEAN
VASLRARVSE YETRYKRTTD LLKTQPQLEA EYTQLNRDYD IHKKNYEQLV TRREAAELSG
DLESTGSGAD FRLIDPPRAS SKPVAPNRLL LLPGGLALAL AAGLFVAFVA SQIRPVFFDG
KTLREVSGLP LLGTVSLLPN PVRKQKERAS LKRFLIATFG LISAYGFGIA ALFILTQRAA