Gene TBFG_12586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_12586 
Symbol 
ID5223268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp2899690 
End bp2903112 
Gene Length3423 bp 
Protein Length1140 aa 
Translation table11 
GC content69% 
IMG OID640607348 
Producthypothetical protein 
Protein accessionYP_001288515 
Protein GI148823761 
COG category[E] Amino acid transport and metabolism
[S] Function unknown 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases
[COG4196] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones121 
Plasmid unclonability p-value0.265908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones195 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCTAC GGCCCACTCA GGTGTCCGGC ACCGGGCGTA CGCGCTGCGC CGGGCGGTCC 
GGTGTGATCT CATCAGCAGC TATGAGCATC AAAGTTGCGC TGGAGCACCG CACCAGCTAC
ACCTTTGACC GGCTGGTGCG GGTGTATCCG CACATCGTGC GGCTACGCCC GGCGCCGCAC
TCCCGCACCT CCATCGAAGC CTACTCGCTG CGCATCGAGC CCGCCGACCA CTTCATCAAC
TGGCAGCAGG ACGCGCTGGG CAACTTTCTG GCGCGGCTGG TCTTTCCGAA TCCCATGCGC
CAACTGCGTA TTACCGTCGG GCTTATCGCC GACCTCAAGG TGATCAACCC CTTCGACTTC
TTTATCGAGG ACTGGGCCGA GATATGGCCC TGCGCAGGGA TGGCCTACCC CAAGGCGCTC
GCCGATGACC TGAGGCCGTA CTTGCGGCCG GTCGACGAAG ACGGCGACGG TTCGGGCCCC
GGCGAGCTCA CGCAGGCCTG GGTGCGCAAC TTCACGGTGC CCGATGGCAC CCGCACCATC
GACTTCTTGG TCGCACTCAA CCGCGCGATC AACGCCGACG TCGGCTACTG CGTGCGCATG
GAGCCCGGAG TTCAGACACC GGATTTCACG CTGCGCACCG GCGTCGGCTC GTGCCGGGAC
TCGGCGTGGC TGCTGGTCTC GATCCTGCGT CAGTTCGGGC TGGCCGCCCG GTTCGTGTCC
GGCTACCTGG TTCAGCTGGC ATCCGACATC GAAGCGCTCG ACGGGCCGTC GGGGCCCGCC
GCCGACTTCA CCGACCTGCA CGCGTGGGCC GAGGCATACA TCCCGGGTGC CGGCTGGATC
GGGCTGGACC CGACGTCGGG GCTGTTGGCC GGCGAGGGCC ACATTCCGCT GGCGGCTACG
CCCCACCCCG CCAGCGCGGC ACCCATCAGC GGCGGCACCG ACGTGTGCGA CACCGTGCTG
GAGTTCTCCA ACACCGTCAC CCGCGTACAC GAAGACCCAC GTGTCACGTT GCCCTACACC
GACGAGTCCT GGAAGACCAT CTGTGAGGTG GGCCAGCGCG TCGATGAGCG GCTGGCCGCC
GCCGACGTCC GGCTGACCGT CGGCGGCGAA CCGACGTTCG TGTCGGTGGA TAACCAGGTC
GCCGAAGAGT GGCGGACGGC GGCCGACGGC CCACACAAAC GCGAACGGGC ATCCGACCTG
GCCGCCCGCT TGAAGGCGGT GTGGGCCCCG CAGGGACTCA TCCACCGCGG TCAGGGCAGG
TGGTATCCCG GAGAGCCGTT GCCGCGCTGG CAGATTGCGC TGTATTGGCG CACCGACGGG
CGGCCGCTGT GGACCAACGA CGCGCTGTTG GCCGACCCCT GGGGCGCCCC GCCCGCCGAC
CCCGTCGACG ACGACGCGGC CTACCGGGTG CTCGCCGGGA TCGCCGACGG CTTGGGGCTG
CCGATCTCGC AGGTGCGGCC CGCCTACGAA GACCCGTTGA GCCGGCTGGC TGCGGCCGTG
CGAATGCCAG CCGGCGACCC GGTGGAATCC GGTGACGACC TCGGCTGCGA CACCAACCCC
GACACCCCCA CCGGCCGCGC CGCGCTGCTG GCGCGCCTCG ATGAGGCCAT CACCTCTCCG
GCTGCGTACG TGCTGCCGCT GCACCGCCGC GACGACGGGC AAGGCTGGGC CAGCGCGAAC
TGGCGGCTGC GCCGCGGTCG CATCGTGTTG CTCGAAGGGG ATTCGCCGGC GGGCCTGCGG
CTGCCGCTGG ATTCGATCAG CTGGCGCCCA CCCCGGGCAT CGTTTGACGC CGACCCGGTA
GCTGTGCGAT CCACATTGCC GGCGGAGCCC CACACCGACC GGGCCGTAGT GGAGGATCCC
GAGACGGCTC CGACCACCGC GTTGGTCGCC GAGGTCCGGG GTGGGCTGGT GCACATCTTC
TTGCCGCCCA CCGACGCGCT CGAGCACTTC ATCGACCTTG TCGCCCGAGT CGAGGCCGCG
GCGACGACGG CCAACTGCCC GGTGGTGATC GAGGGCTACG GCCCACCCCC GGACCCGCGG
CTGACGTCCA CCACAATCAC CCCCGACCCC GGCGTCATCG AGGTCAACAT CGCGCCCACC
GCCTCTTTTG CAGAACAACG GCAACAGCTG GAAACCCTGT ATCAACAAGC GCGCCTGGCC
CGACTCACCA CCGAAGCGTT CGACGTCGAC GGCACGCACG GCGGCACCGG CGGCGGCAAC
CACATCACGC TTGGCGGCGT CACACCCGCG GACTCACCGC TGCTGCGCCG GCCCGACCTG
CTGGTTTCAC TGCTGACCTA CTGGCAGCGA CACCCGTCGT TGTCCTACTT GTTCGCCGGG
CGTTTCGTCG GCACCACGTC ACAGGCGCCC CGGGTTGACG AGGGCCGCGC CGAGGCGCTC
TACGAACTCG AGATCGCGTT CGCCGAGATC CTCCGGCTGT CGCCGTCGTC CGGGGGCGGC
CGGCCCCAAC CGTGGGTGAC CGACCGCGCG CTGCGGCACC TGCTCACCGA CATCACCGGC
AACACCCATC GCGCCGAATT CTGCATCGAC AAGCTCTACA GCCCCGACAG CGCCCGGGGC
AGGCTCGGCC TGCTGGAGCT CCGCGGGTTC GAGATGCCGC CGCACCTGCA CATGGCGATG
GTGCAGTCGC TGCTGGTGCG CTCGCTGGTG GCGTGGTTCT GGGACCAACC GCTGCGCGCC
CCGCTGATCC GCCACGGCGC CAACTTGCAC GGTCGATATC TATTGCCGCA CTTCTTGATT
CATGACATCG CCGACGTCGC AGCCGACCTG CGCGCGCACG GCATCGCGTT CGAGACTAGC
TGGCTGGACC CGTTCACCGA GTTCCGCTTC CCGCGCATCG GCACCGCCGT ATTCGACGGC
ATTGAGATCG AGCTGCGCGG GGCCATCGAG CCATGGCACA CCCTTGGCGA GGAGGCCACC
GCGGCAGGCA CCGCGCGCTA TGTCGACTCG TCGGTCGAGC GCATCCAGGT CCGCATCATC
GGCGCCGACC GGCACCGCTA CGTGGTGACC TGTAACGGCT ACCCGATGCC GTTGCTGGCT
ACCGACAACC CCGACATCCA CGTGGGTGGT GTGCGGTTCA AAGCGTGGCA GCCGCCCAGC
GCGCTACACC CGACCATCAC GGTCGACGGC CCGTTGCGGT TCGAGCTCAT CGACATCGCC
ACCGCTACCT CGTGCGGCGG CTGTACCTAC CATGTCGCCC ATCCGGGCGG CCGCGCCTAC
GACGAGCCCC CGGTCAACGC CGTGGAGGCG GAGGCCCGCC GCGCCCGGCG CTTCGAGGCG
ACCGGCTTCA CCCCGGGCAA GCTCGACCTG TCCGACATCC GGGAGAAACA GGCCAGGATA
TCCACCGATA TCGGCGCGCC GGGCATCCTC GACCTACGAC GCGTGCGTAC CGTGCAACAG
TAA
 
Protein sequence
MPLRPTQVSG TGRTRCAGRS GVISSAAMSI KVALEHRTSY TFDRLVRVYP HIVRLRPAPH 
SRTSIEAYSL RIEPADHFIN WQQDALGNFL ARLVFPNPMR QLRITVGLIA DLKVINPFDF
FIEDWAEIWP CAGMAYPKAL ADDLRPYLRP VDEDGDGSGP GELTQAWVRN FTVPDGTRTI
DFLVALNRAI NADVGYCVRM EPGVQTPDFT LRTGVGSCRD SAWLLVSILR QFGLAARFVS
GYLVQLASDI EALDGPSGPA ADFTDLHAWA EAYIPGAGWI GLDPTSGLLA GEGHIPLAAT
PHPASAAPIS GGTDVCDTVL EFSNTVTRVH EDPRVTLPYT DESWKTICEV GQRVDERLAA
ADVRLTVGGE PTFVSVDNQV AEEWRTAADG PHKRERASDL AARLKAVWAP QGLIHRGQGR
WYPGEPLPRW QIALYWRTDG RPLWTNDALL ADPWGAPPAD PVDDDAAYRV LAGIADGLGL
PISQVRPAYE DPLSRLAAAV RMPAGDPVES GDDLGCDTNP DTPTGRAALL ARLDEAITSP
AAYVLPLHRR DDGQGWASAN WRLRRGRIVL LEGDSPAGLR LPLDSISWRP PRASFDADPV
AVRSTLPAEP HTDRAVVEDP ETAPTTALVA EVRGGLVHIF LPPTDALEHF IDLVARVEAA
ATTANCPVVI EGYGPPPDPR LTSTTITPDP GVIEVNIAPT ASFAEQRQQL ETLYQQARLA
RLTTEAFDVD GTHGGTGGGN HITLGGVTPA DSPLLRRPDL LVSLLTYWQR HPSLSYLFAG
RFVGTTSQAP RVDEGRAEAL YELEIAFAEI LRLSPSSGGG RPQPWVTDRA LRHLLTDITG
NTHRAEFCID KLYSPDSARG RLGLLELRGF EMPPHLHMAM VQSLLVRSLV AWFWDQPLRA
PLIRHGANLH GRYLLPHFLI HDIADVAADL RAHGIAFETS WLDPFTEFRF PRIGTAVFDG
IEIELRGAIE PWHTLGEEAT AAGTARYVDS SVERIQVRII GADRHRYVVT CNGYPMPLLA
TDNPDIHVGG VRFKAWQPPS ALHPTITVDG PLRFELIDIA TATSCGGCTY HVAHPGGRAY
DEPPVNAVEA EARRARRFEA TGFTPGKLDL SDIREKQARI STDIGAPGIL DLRRVRTVQQ