Gene Tbis_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbis_2044 
Symbol 
ID9168538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobispora bispora DSM 43833 
KingdomBacteria 
Replicon accessionNC_014165 
Strand
Start bp2368972 
End bp2370129 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content68% 
IMG OID 
Productaminodeoxychorismate lyase 
Protein accessionYP_003652649 
Protein GI296270017 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0389257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACG TTGACTTGGA CTTCCTCCTC GGGGATGCGG AGGACGAACG CCCGTCCCGG 
CGTCGTCCCC CCGGGAGCCG GGTACAGCAG CGCCGGAGCC GCAAGCGGCG CAGGCGGCAG
CGCCGGAAGG GGTACATCGC GACCGTCTTC GCCATGCTCG TCATCGTCGG CGTCCTCGGC
GGTGGCGTGT ACTACGGCGT CAACGTGGCG CGCGAGGTGC TGACCCCCAA GGACTTCACC
GGCGAGGGGC ATGGCGAGGT GGAGGTCGAG GTCAAGGAAG GGGCGACCGC GACCGACGTC
GCGCAGCTCC TGGAGAAGGA GGGCGTCGTG GCGAGCGCCC GGACGTTCCT CAACGTGATC
GGCGCCGCGG GCAAGACCTC CTCGCTCCAG CCCGGCGTGT ACACGCTGCG CAAGGGCATG
TCGGCCGAGG CGGCCCTCAA AGCGATGCTC GACCCGGGCA ACAAGGTGGT CAACCGGGTC
ACCATCCGGG AAGGGCTGCG GCTGAGCAAG ATCTTCACCG AGCTCTCCAC GGCCACCGGC
AGGCCGGTCG AGGAGTTCCA GAAGGCGGCC AAGGAGGACA TCGGCCTCCC GTCGTACGCC
AAGGGCCGGC TCGAGGGCTT CGCCTTCCCG GCGACCTATG ACATCAGCCC CAAGGACACC
CCCAAGACGA TCCTCTCCCG GATGGTCGAG CGGTTCGTGC AGACCGCGGA GCGCCTCGAT
CTCGAGCGGC GGGCCAAGGA GCTCGGCTAC ACGCCCCGGC AGATAATGAT CATCGCGAGC
ATCGTCCAGG CCGAGTCCGG ACGGCTCGAG GACATGCCGA AGGTCGCCCG GGTGATCTAC
AACCGGCTGA GCCGGAACCC GCCGATGAAG CTGGAGATGG ACAGCACCCT CATGTACGGG
CTCGGCAAGT ACGGCATCGC CGCCACCAAC GAGGACCTCA AAAGCGACAG CCCGTACAAC
ACCTACCGGC GGTACGGCCT GCCCCCGGGC CCGATCTGCA ACCCCGGCGA CCACGCGATC
GAGGCCGCGC TCAATCCCGC CGACGGCAAC TGGCTGTGGT TCGTGACCGT GGACCCGAAG
CGCGGCATCA CCAAGTTCAC CGACAAGGAG TCGGAGTTTT GGAAGCTTCG CGAGGAGTTC
AACCGGAACC GCGGGTGA
 
Protein sequence
MNDVDLDFLL GDAEDERPSR RRPPGSRVQQ RRSRKRRRRQ RRKGYIATVF AMLVIVGVLG 
GGVYYGVNVA REVLTPKDFT GEGHGEVEVE VKEGATATDV AQLLEKEGVV ASARTFLNVI
GAAGKTSSLQ PGVYTLRKGM SAEAALKAML DPGNKVVNRV TIREGLRLSK IFTELSTATG
RPVEEFQKAA KEDIGLPSYA KGRLEGFAFP ATYDISPKDT PKTILSRMVE RFVQTAERLD
LERRAKELGY TPRQIMIIAS IVQAESGRLE DMPKVARVIY NRLSRNPPMK LEMDSTLMYG
LGKYGIAATN EDLKSDSPYN TYRRYGLPPG PICNPGDHAI EAALNPADGN WLWFVTVDPK
RGITKFTDKE SEFWKLREEF NRNRG