Gene TBFG_12837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_12837 
Symbol 
ID5223523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp3140973 
End bp3143411 
Gene Length2439 bp 
Protein Length812 aa 
Translation table11 
GC content61% 
IMG OID640607603 
Producthypothetical protein 
Protein accessionYP_001288766 
Protein GI148824012 
COG category[R] General function prediction only 
COG ID[COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) 
TIGRFAM ID[TIGR02578] CRISPR-associated protein, Csm1 family 


Plasmid Coverage information

Num covering plasmid clones217 
Plasmid unclonability p-value0.867182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones208 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCGC AACTCATCGA GGCCATAATC GGCTGCCTCT TGCACGACAT TGGCAAACCG 
GTCCAGCGCG CGGCGCTCGG CTACCCGGGC AGGCACAGTG CGATTGGCCG CGCTTTTATG
AAGAAGGTGT GGTTGCGCGA CAGCCGCAAT CCGTCGCAGT TCACCGACGA GGTGGATGAG
GCTGACATTG GGGTCTCCGA CCGCCGCATT CTCGACGCGA TCAGCTATCA CCACAGTTCT
GCGCTGCGTA CGGCGGCCGA GAATGGCCGC CTTGCCGCCG ATGCGCCGGC CTACATCGCC
TACATCGCCG ACAATATCGC GGCCGGAACC GACCGCCGCA AGGCCGACTC CGACGACGGC
CATGGTGCGA GCACTTGGGA TCCGGACACG CCCCTGTATT CGATGTTCAA CCGATTCGGC
TCCGGCACAG CGAATCTGGC ATTTGCCCCG GAGATGCTCG ACGACCGCAA GCCGATCAAT
ATACCGTCGC CACGCCGGAT CGAATTCGAC AAGGACCGCT ACGCCGCCAT CGTCAACAAA
CTTAAAGCCA TTCTGGTCGA CCTCGAACGT TCCGACACCT ACCTCGCCAG CCTCCTCAAC
GTCCTCGAGG CGACGCTGTC GTTCGTGCCG TCCTCGACCG ACGCGTCCGA GGTCGTCGAC
GTCTCACTCT TCGACCACCT GAAGCTGACG GGTGCGCTCG GCGCCTGCAT CTGGCACTAC
CTACAAGCCA CCGGACAAAG CGACTTCAAG TCAGCGCTGT TCGACAAGCA GGACACCTTC
TACAACGAAA AAGCCTTCCT GCTCACAACT TTCGACGTCT CAGGCATCCA GGACTTCATC
TACACGATCC ATTCCTCGGG TGCCGCGAAG ATGCTGCGTG CCCGCAGCTT CTACCTGGAG
ATGCTGACCG AGCATCTCAT CGACGAGCTA CTTGCGCGGG TGGGTCTCAG CCGCGCGAAT
CTCAACTACT CCGGCGGCGG GCACGCGTAC CTGCTGCTGC CCAACACGGA GTCCGCGCGG
AAATCCGTCG AACAGTTCGA GCGTGAGGCC AACGACTGGC TGCTGGAAAA CTTCGCAACC
CGGCTCTTCA TCGCCACGGG TAGCGTACCG CTTGCCGCGA ACGACCTGAT GCGTCGGCCG
AACGAGAGTG CGAGCCAGGC AAGTAACCGC GCCCTCCGCT ACAGCGGGCT CTACCGTGAG
TTGAGCGAGC AACTTTCCGC GAAGAAGCTC GCCCGATACA GCGCTGACCA ACTGCGGGAA
CTCAACTCGC GCGATCACGA CGGTCAGAAA GGTGACCGGG AATGCAGCGT GTGCCACACG
GTCAACCGCA CGGTCAGCGC CGACGACGAG CCAAAGTGCA GCCTGTGCCA AGCGCTGACC
GCTGCGTCTT CGCAGATTCA ATCCGAGTCT CGCCGCTTCC TACTCATCTC TGACGGCGCC
ACCAAAGGTC TGCCCCTGCC GTTCGGCGCC ACACTCACGT TCTGTAGCCG AGCCGACGCC
GATAAGGCAC TCCAGCAACC CCAAACCCGA AGGCGGTACG CGAAGAACAA GTTCTTCGCC
GGCGAGTGTT TGGGCACCGG GCTCTGGGTG GGCGACTACG TCGCACAGAT GGAGTTCGGT
GACTACGTGA AGCGTGCGAG CGGAATCGCG CGCCTCGGGG TTCTGCGCCT TGACGTCGAT
AACCTGGGCC AGGCATTCAC GCACGGCTTC ATGGAGCAAG GCAACGGCAA GTTCAACACG
ATTAGCCGCA CGGCCGCGTT CTCCCGGATG CTGTCGTTGT TCTTCCGGCA GCACATCAAC
TACGTGTTGG CACGCCCGAA ACTGCGCCCG ATCACCGGCG ATGACCCGGC GCGGCCCCGC
GAGGCCACGA TCATCTACTC CGGTGGCGAT GACGTCTTCG TCGTGGGCGC GTGGGACGAC
GTCATCGAGT TCGGGATCGA GCTTCGGGAG CGGTTCCACG AATTCACCCA GGGCAAACTC
ACCGTGTCGG CTGGCATCGG CATGTTCCCC GACAAGTACC CCATCTCCGT GATGGCCCGC
GAAGTCGGAG ATCTCGAAGA CGCGGCGAAG TCGCTGCCCG GCAAGAACGG GGTTGCACTC
TTCGATCGCG AGTTCACCTT CGGCTGGGAT GAGCTGCTCA GCAAGGTGAT CGAGGAGAAG
TACCGGCACA TCGCCGACTA TTTCAGTGGC AACGAAGAAC GCGGCATGGC CTTCATCTAC
AAGCTGCTCG AACTACTCGC CGAACGCGAC GATCGAATCA CAAAGGCCAG ATGGGTGTAC
TTCCTCACGC GCATGCGTAA CCCCACCGGT GACACAGCGC CTTTTCAGCA GTTTGCTAAC
CGGCTACACC AATGGTTCCA AGATCCGACA GACGCCAAGC AACTCAAGAC CGCGCTGCAC
CTCTACATCT ATCGCACTCG CAAGGAGGAG TCCGAATGA
 
Protein sequence
MNPQLIEAII GCLLHDIGKP VQRAALGYPG RHSAIGRAFM KKVWLRDSRN PSQFTDEVDE 
ADIGVSDRRI LDAISYHHSS ALRTAAENGR LAADAPAYIA YIADNIAAGT DRRKADSDDG
HGASTWDPDT PLYSMFNRFG SGTANLAFAP EMLDDRKPIN IPSPRRIEFD KDRYAAIVNK
LKAILVDLER SDTYLASLLN VLEATLSFVP SSTDASEVVD VSLFDHLKLT GALGACIWHY
LQATGQSDFK SALFDKQDTF YNEKAFLLTT FDVSGIQDFI YTIHSSGAAK MLRARSFYLE
MLTEHLIDEL LARVGLSRAN LNYSGGGHAY LLLPNTESAR KSVEQFEREA NDWLLENFAT
RLFIATGSVP LAANDLMRRP NESASQASNR ALRYSGLYRE LSEQLSAKKL ARYSADQLRE
LNSRDHDGQK GDRECSVCHT VNRTVSADDE PKCSLCQALT AASSQIQSES RRFLLISDGA
TKGLPLPFGA TLTFCSRADA DKALQQPQTR RRYAKNKFFA GECLGTGLWV GDYVAQMEFG
DYVKRASGIA RLGVLRLDVD NLGQAFTHGF MEQGNGKFNT ISRTAAFSRM LSLFFRQHIN
YVLARPKLRP ITGDDPARPR EATIIYSGGD DVFVVGAWDD VIEFGIELRE RFHEFTQGKL
TVSAGIGMFP DKYPISVMAR EVGDLEDAAK SLPGKNGVAL FDREFTFGWD ELLSKVIEEK
YRHIADYFSG NEERGMAFIY KLLELLAERD DRITKARWVY FLTRMRNPTG DTAPFQQFAN
RLHQWFQDPT DAKQLKTALH LYIYRTRKEE SE