Gene Tbd_2228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbd_2228 
Symbol 
ID3673918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiobacillus denitrificans ATCC 25259 
KingdomBacteria 
Replicon accessionNC_007404 
Strand
Start bp2307504 
End bp2309069 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content67% 
IMG OID637710933 
Productanthranilate synthase, component I 
Protein accessionYP_315986 
Protein GI74318246 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCGC TCGTCCTGAC AAGACGCGCA GCACATTCCC TGACCCCTGA ATCCCGGATC 
CCGACCCCTG CCATGACCGA ACAAGAATTC TTCGACCTTG CCCGCCAGGG CTTCAACCGT
ATCCCGCTGG TGCGCGAACT GCCCGGCGAC CTCGAGACGC CGCTGTCGGT CTATCTCAAG
CTCGCCAACG CGCCCTATAC CTATCTCCTC GAATCGGTCG TCGGCGGCGA ACGCTTCGGA
CGCTATTCCT TCATCGGCCT GCCGGCGCGC ACCGTGCTCA GGGTGCGCGC CCACCTCCTG
ACGGTCGAAA CCGACGGCCA GGTCGTCGAG GAATGCAGCA CGCCCGATCC GCTCGCTTTC
ATCGCCGAGT ATCAGACCCG CTTCAAGGCC GCGCCGGTCC TGGGCTTGCC GCGCTTCACC
GGCGGCCTGG CCGGCTACTT CGGCTACGAC ACGATCCGTT ACATCGAGAA GCGACTGACG
CCGGAATTCA ACAAGAACCT CGTGCGCAAG GACGACGTCC TGCACACGCC CGACATCCTG
CTGATGCTGA CCGAGGAACT CGCCGTCTTC GACAACCTCG CCGGCAAGCT CTATCTCGTC
ACCCACGCCG ACCCGATGCA GGCGGACGCC TATGCGCAGG GGCACTTGCG GCTCGCGGAG
CTTGCCGAGA AGCTGCACGC GCCGGTCACG CTGCCGAGCG AGCCGCGCCC GGTCGTCGCC
GAAGCGAGCT CCGAATTCGG CGAAGCCGAG TTCAAGGCCG CGGTTGCCAA GGCCAAGGAG
TACATCGCGG CCGGCGACGT CATGCAGGTC GTGCTGTCGC AGCGGATGGC GCGGCCGTTC
GAGGCCTCGC CGCTGTCGCT CTACCGTGCG CTGCGCAGCC TCAACCCGTC GCCCTACATG
TATTTCTACG ACCTTGGCGG CTTCCACATC GTCGGCTCGT CGCCCGAGAT CCTGGTCCGG
CTCGAGGGCG ACACGGTCAC GCTGCGGCCG ATCGCCGGCA CGCGGCCGCG CGGGCTCACG
CGCGAGGACG ATCAGCGCCT CGCGGCCGAA TTGATGGCCG ACCCGAAGGA ATGCGCCGAA
CATCTGCAGC TCCTCGACCT CGGGCGCAAC GACACGGGGC GGGTCGCCGT GACCGGCAGC
GTCAAGGTCA CGGAACACAT GCAGATCGAG CGCTATTCGC ACGTCATGCA CATCGTCTCG
AACGTCGAAG GCAAGCTCAA GCCCGGACTC TCCGCGCTCG ACGTCCTGCG CGCGAGCTTT
CCGGCAGGTA CGGTCTCGGG GGCGCCCAAG GTGCGGGCGA TGGAGATCAT CGACGAGCTC
GAACCGAGCA AGCGCGGGGT CTACGCCGGC GCCGTCGGCT ACCTCGGCTT CAGCGGCGAC
ATGGATCTCG CGATCGCGAT CCGCACGGCC GTCGTCAAGG ATGGAATGCT CTACGCCCAG
GCCGGCGCCG GCATCGTCGC CGACTCGGTC CCCGACAACG AGTGGACCGA GACGCTCAAC
AAGGCGCGGG CGGTCGTGCG TGCGGCCGAG CTCGCCTACG CGCGCTTCGG CGACGTCGTC
GAATAA
 
Protein sequence
MAALVLTRRA AHSLTPESRI PTPAMTEQEF FDLARQGFNR IPLVRELPGD LETPLSVYLK 
LANAPYTYLL ESVVGGERFG RYSFIGLPAR TVLRVRAHLL TVETDGQVVE ECSTPDPLAF
IAEYQTRFKA APVLGLPRFT GGLAGYFGYD TIRYIEKRLT PEFNKNLVRK DDVLHTPDIL
LMLTEELAVF DNLAGKLYLV THADPMQADA YAQGHLRLAE LAEKLHAPVT LPSEPRPVVA
EASSEFGEAE FKAAVAKAKE YIAAGDVMQV VLSQRMARPF EASPLSLYRA LRSLNPSPYM
YFYDLGGFHI VGSSPEILVR LEGDTVTLRP IAGTRPRGLT REDDQRLAAE LMADPKECAE
HLQLLDLGRN DTGRVAVTGS VKVTEHMQIE RYSHVMHIVS NVEGKLKPGL SALDVLRASF
PAGTVSGAPK VRAMEIIDEL EPSKRGVYAG AVGYLGFSGD MDLAIAIRTA VVKDGMLYAQ
AGAGIVADSV PDNEWTETLN KARAVVRAAE LAYARFGDVV E