Gene Tbis_0414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbis_0414 
Symbol 
ID9166895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobispora bispora DSM 43833 
KingdomBacteria 
Replicon accessionNC_014165 
Strand
Start bp463890 
End bp466301 
Gene Length2412 bp 
Protein Length803 aa 
Translation table11 
GC content70% 
IMG OID 
ProductNHL repeat containing protein 
Protein accessionYP_003651035 
Protein GI296268403 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.316951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.387155 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAGC GCCCGGCCGC GGCGTTCGCC GCGGCGCTCT TACTCGCGGT GGCGGCGGGC 
CTGGTCCGGC CCGCCGCCGA GGCCGGCGCG GCCACCGTGC TCACCTTCGC CGACGAGGAC
CGCTCGTCCG AGCTGACCGG GCACGTCGTC CACCAGCGGT ACCAGTTCAA CACGCCCACC
ACGGTCACCT ACGACCGGGA CACCATCACC GGCAACCCCG CCGCGGCCCG GGTCTACGTG
GCCGACATGG GCAACCACGT GATCCGCGTC TTCGACCTCA ACGGCAAGCA GATCGGCCGG
CTCGACGACG CGGACACCCA GCTGGCCCCG GACAGCCCGG CGAGCTCGGT GCCGCAGATC
ACCGCGCCGC TCGGCATCTA CTTCCTCTCC AAGAGCGAGG CCGTCGACGA CCGGCTGGCC
GGGCTCTACA TCAACGACGT CGGCGTGCAC AAGCTGCACT TCTTCCGCAC CGACCCGTCC
AACCCGGACC GGTTCTACTA CGTGACCTCC TTCGGCCAGG AGGGCCACGG CGGCGGCGCC
GACCTGAAGC TGCCGCGGAA CATGACGGTC ACCCCGGACG GCCTGCTGTA CGTCTCCGAC
GAGTTCAACC ACCGGATCAA GGGCTTCCGG ATCGACCCGG ACACCTGGAC CGCGACCCTC
GTCACCACGG TCGGCTCCCA GGGCGGCCCC ATCATCCCCG GCACGGACAA GGACTACGGC
ACCGACTCCA CCCACTACGA CGACTACGCG GGCGAGCCGC TCAAGCGGGA CGGCTTCCGC
ATCCCGCAGG GCATGACCTA CTGGCGGACG CCGGACGGCT CCCGCACCTA CCTCTACGTC
GCCGACAACG GCAACAACCG CGTCAAGATC TTCGAGGTCG CGGCGAGCGG CACGCTCACC
CTCGTCGACA TCCTCGGCCG GTTCACGCGG AACGGCACCG CCGACCACCT CAAGCGCCCC
CGCGGCGTCC GGGTCGACGT GAACGGCAAC CTCTACGTCG CCGACACCTA CGGCGGCCGG
ATCATCCGCT TCCCGAACCT CGGCACCAAC ACCGCCAAGT ACCGCACCTC GCTCAGCGCG
GACGCCGCCG CCTCCTGGGT GTACGGCCGG CTCGGCATCC ACCAGGTGGA GATGCGCACC
CCCGCCACCG CGCTCACCGA GGACGAGGCG TTCCAGCTCC CCAACGACGT GGTCCCGGTG
GAGACCCCGA GCGGCGCCCG GTACACCGAG AACATCTGGT CGTGGGGCGT CTACTACCCC
GGCGCGCGGG TGCTGCTGGT GAGCGACACC GGCAACCACC GGATCAAGAA GTGCTGGGAG
CACCCCACCC AGAACACGAT CCTCCGTTGC TCGGTCTCGG CCGGCGTCGG CGGGGTCACC
GCCCACGAGT TCTGGGGCCA CCCGCGCACG CTCGCCGGCC AGCTCCACGC GGTGGGCGGC
ATGGACCTGC TGCCCGGGCA GGGGAGCGAC CCCGACACCC TGCTCGTCAG CGACACCCCC
AACACCGTCA TCTACCGCTA CGGGCTCGAC GGGTCGTACA AGGGCAAGTT CACCGGCGGC
TCGATCTCGT ACGGGGTCAC CGGGCTGAGC GTCTACCCGG TCTCCGGGAG CCACCACGTC
GGCGTGCTCG TCGCCGCCGA CGCGACCCTG CCCTACCCGT ACACCGGGGA CAGCTCGCTG
CGCATCTACA ACCGCGCCGG CGGCTCCGTC AACGTCTTCA ACCTCACCAC CCGCACCTCC
GGCGCCTCGA AGATCAGCTA CACCGGCGGG AACTTCCCGG TGGCGATCGA CATCGTGCCG
GAGGGCGGCT CGTACGGGGT GTTCATCAGC ACCTCCGGCA ACCGGCTCTA CCGGTTCACC
CTGAGCGGCT CCTCGCTCAC GCTCAACTGG GTGACCGGCG GCCCCGACCC CAGCAAGGGG
TCCGACTCCG GCTCGACCTG GAACCTCGGC CCGAACTTCT ACGGCGAGGG CGCGGCCGGC
ACGTTCGACC AGATCCAGGA CGTCACCGCC GGCGGCGGCC GGGTCTACGC GGTGGACCGG
CGCAACCAGC GGATCCAGGT CTTCAACGCC TCCACCGGCG CCTACATCGC CAAGATCGGC
AAGGGTGGCG GCACCTACGA CCACCCGGCG TCGATCACCC CCGACGAGTT CTTCCTCCCG
CACGGCGTGC GGCTCGACGG CGGCCTGCTG GTCGGCGACG GGTTCAACAT GATCGTCCGC
GACTACGACG ACCCGACCGG GCTGACGCCC GACTCCTCCG GCCGGCTGCC GGTGACCATG
CGCGGCTACT GGGTCGACCC GCACCTCGGC ACCCGCAAGG GCGGCCTGTT CGCCACCCAG
CACGTGCTGC GCGCCGGGCC GTACGTGTTC GTGGACTCCC TGATCTCGAA TCGCATCACC
CGGATCTCAT AG
 
Protein sequence
MRKRPAAAFA AALLLAVAAG LVRPAAEAGA ATVLTFADED RSSELTGHVV HQRYQFNTPT 
TVTYDRDTIT GNPAAARVYV ADMGNHVIRV FDLNGKQIGR LDDADTQLAP DSPASSVPQI
TAPLGIYFLS KSEAVDDRLA GLYINDVGVH KLHFFRTDPS NPDRFYYVTS FGQEGHGGGA
DLKLPRNMTV TPDGLLYVSD EFNHRIKGFR IDPDTWTATL VTTVGSQGGP IIPGTDKDYG
TDSTHYDDYA GEPLKRDGFR IPQGMTYWRT PDGSRTYLYV ADNGNNRVKI FEVAASGTLT
LVDILGRFTR NGTADHLKRP RGVRVDVNGN LYVADTYGGR IIRFPNLGTN TAKYRTSLSA
DAAASWVYGR LGIHQVEMRT PATALTEDEA FQLPNDVVPV ETPSGARYTE NIWSWGVYYP
GARVLLVSDT GNHRIKKCWE HPTQNTILRC SVSAGVGGVT AHEFWGHPRT LAGQLHAVGG
MDLLPGQGSD PDTLLVSDTP NTVIYRYGLD GSYKGKFTGG SISYGVTGLS VYPVSGSHHV
GVLVAADATL PYPYTGDSSL RIYNRAGGSV NVFNLTTRTS GASKISYTGG NFPVAIDIVP
EGGSYGVFIS TSGNRLYRFT LSGSSLTLNW VTGGPDPSKG SDSGSTWNLG PNFYGEGAAG
TFDQIQDVTA GGGRVYAVDR RNQRIQVFNA STGAYIAKIG KGGGTYDHPA SITPDEFFLP
HGVRLDGGLL VGDGFNMIVR DYDDPTGLTP DSSGRLPVTM RGYWVDPHLG TRKGGLFATQ
HVLRAGPYVF VDSLISNRIT RIS