Gene TM1040_0633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0633 
Symbol 
ID4076120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp675784 
End bp676980 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content63% 
IMG OID638005930 
Productaminodeoxychorismate synthase 
Protein accessionYP_612628 
Protein GI99080474 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.284846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGTCGA AACCGCTATA TGCGGGGCAA GGGAGACACT CACTGCTCAC GGAGACCCTG 
CCGGTGGAAA TCCGCTTTGA CAATGGCCCA TCGAAGGGCG CGGCGCTGTT TGAAGACCCA
GTTGACCTGA TCCGCGCGCA AGACCCGGAT GAGGTGCAGG GGGCGCTTGC GGCCTTGGAC
CGCGCCCGCG CCGAGGGGCA TTGGGTTGCA GGCTATGCGT CTTATGAGTT GGGCTATGCG
CTTGAGCCAC GTTTGGCGCA TCTTCTGAAT GGGCGCGGGC ACCGGCGTCT TCCGTTGCTG
CAGTTCGGTG TGTTCCGCGC GCCGGTGGCG GCGGATGTTC CGCTTTGGGC TGGGGATGCA
GGTGTGGGAG AGACGACCGC GCGCTGGGAC GCTGCCCGCT ACACCGAGGC CTTTGATCGC
GTTAAATCCT ACATTGGCGC CGGCGACATA TATCAGGCCA ATCTGACTTT CCCCATCGAC
GCACAGGTCT GGGGCGGGGC GGAGGCGCTT TATGCGGCCC TTGCCGCGCG TCAGCCTGTG
GGCCACGGGG CGCTTGTGCG TCAGGACGGG CTGCCAACGA TCCTGTCGCG CAGCCCGGAG
CTCTTCTTTC GCACATCCTC GGATGGGGTG ATCGAGACGC GTCCCATGAA GGGCACGCAA
CCGCGCAGCC TTGACCCGCG AGAGGATTCG CGACGACGGG ATTTCCTGCG CTCTGACGAA
AAGAACCGCG CTGAAAACCT GATGATTGTC GACCTTCTGC GCAATGACAT CAGCCGCGTG
TCCGAGACCG GCTCGGTCCA TGTGCCAGAG CTGTTTGCCG TGGAAAGCTA TGCGACGGTG
CACCAGATGG TGTCATTGGT GCGGGCACGC CTCAAGGCCG GCTGCGGTCT GGCGGACATC
TTTGCGGCAC TTTATCCCTG CGGGTCGATC ACCGGCGCGC CCAAAATCCG TGCCATGGAG
ATCCTTGCGG AACTCGAGCC CGGGGCGCGC GACATTTATT GCGGCACCAT TGGCTGGGCG
GCCCCCGACG GGCGGTCGGA ATTCAATGTC TCCATACGTA CGATGATGCT GGAGGGCGAT
GCGGCCACGT TCAACGTCGG CGGTGGGCTG GTCTGGGACA GCACCTCCGC CTCCGAGTAT
GAGGAAGCGC TGTGGAAAGC CCGTTTTGCA CAAGTGACGA CCCCGATTTC CGCTTGA
 
Protein sequence
MASKPLYAGQ GRHSLLTETL PVEIRFDNGP SKGAALFEDP VDLIRAQDPD EVQGALAALD 
RARAEGHWVA GYASYELGYA LEPRLAHLLN GRGHRRLPLL QFGVFRAPVA ADVPLWAGDA
GVGETTARWD AARYTEAFDR VKSYIGAGDI YQANLTFPID AQVWGGAEAL YAALAARQPV
GHGALVRQDG LPTILSRSPE LFFRTSSDGV IETRPMKGTQ PRSLDPREDS RRRDFLRSDE
KNRAENLMIV DLLRNDISRV SETGSVHVPE LFAVESYATV HQMVSLVRAR LKAGCGLADI
FAALYPCGSI TGAPKIRAME ILAELEPGAR DIYCGTIGWA APDGRSEFNV SIRTMMLEGD
AATFNVGGGL VWDSTSASEY EEALWKARFA QVTTPISA