Gene TBFG_11020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_11020 
Symbol 
ID5221695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp1120953 
End bp1122161 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content66% 
IMG OID640605772 
Productarginine deiminase 
Protein accessionYP_001286965 
Protein GI148822211 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2235] Arginine deiminase 
TIGRFAM ID[TIGR01078] arginine deiminase 


Plasmid Coverage information

Num covering plasmid clones280 
Plasmid unclonability p-value0.00481648 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones194 
Fosmid unclonability p-value0.252626 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTGTCG AATTGGGGTC AAATTCCGAG GTCGGCGCGC TAAGAGTGGT CATCCTGCAC 
CGCCCGGGGG CCGAACTGCG CCGGCTCACA CCGCGCAACA CCGACCAGCT GCTGTTCGAC
GGCCTGCCCT GGGTATCCCG CGCGCAGGAC GAGCACGACG AATTCGCCGA GCTGCTGGCT
TCCCGCGGTG CGGAAGTGCT GTTGCTGTCG GACCTGTTGA CTGAGGCACT ACATCACAGC
GGGGCCGCCC GCATGCAGGG GATCGCCGCT GCCGTCGACG CACCGCGGCT GGGACTGCCG
CTGGCGCAAG AGCTTTCGGC CTACCTGCGT AGTCTCGACC CAGGCAGGTT GGCGCATGTG
CTGACGGCCG GCATGACCTT CAACGAGCTC CCGTCGGACA CGCGGACCGA CGTGTCGTTG
GTGTTGCGTA TGCACCATGG CGGAGACTTC GTCATTGAGC CGTTGCCGAA CCTGGTGTTC
ACCCGCGACT CGTCGATATG GATCGGGCCG CGGGTGGTGA TCCCGTCGCT GGCATTACGG
GCACGGGTGC GCGAAGCGTC GCTGACCGAC CTCATCTATG CTCATCACCC GCGGTTCACC
GGTGTGCGGC GTGCCTATGA ATCGCGCACC GCTCCGGTCG AGGGTGGCGA CGTGTTGTTG
CTCGCCCCGG GTGTGGTCGC TGTCGGAGTG GGCGAGCGGA CTACACCAGC AGGCGCGGAA
GCATTGGCGC GCAGCCTTTT TGACGATGAT CTTGCGCATA CCGTGCTCGC CGTGCCGATC
GCTCAGCAGC GCGCGCAAAT GCATCTGGAC ACGGTGTGCA CGATGGTCGA CACCGATACG
ACGGTGATGT ACGCCAACGT TGTCGACACG CTCGAGGCGT TCACGATCCA GCGCACACCC
GACGGCGTGA CCATCGGCGA TGCGGCCCCG TTCGCGGAGG CGGCTGCCAA GGCGATGGGA
ATCGACAAGC TGCGGGTAAT TCATACCGGA ATGGACCCCG TCGTCGCTGA ACGCGAACAG
TGGGACGACG GCAACAACAC GTTGGCGTTG GCGCCCGGTG TCGTTGTCGC CTACGAGCGC
AACGTACAGA CCAACGCCCG CCTGCAGGAC GCGGGCATCG AAGTGCTTAC CATCGCCGGC
TCCGAATTGG GTACCGGCCG TGGCGGGCCC CGCTGCATGT CCTGTCCGGC CGCCCGCGAT
CCGCTTTAG
 
Protein sequence
MGVELGSNSE VGALRVVILH RPGAELRRLT PRNTDQLLFD GLPWVSRAQD EHDEFAELLA 
SRGAEVLLLS DLLTEALHHS GAARMQGIAA AVDAPRLGLP LAQELSAYLR SLDPGRLAHV
LTAGMTFNEL PSDTRTDVSL VLRMHHGGDF VIEPLPNLVF TRDSSIWIGP RVVIPSLALR
ARVREASLTD LIYAHHPRFT GVRRAYESRT APVEGGDVLL LAPGVVAVGV GERTTPAGAE
ALARSLFDDD LAHTVLAVPI AQQRAQMHLD TVCTMVDTDT TVMYANVVDT LEAFTIQRTP
DGVTIGDAAP FAEAAAKAMG IDKLRVIHTG MDPVVAEREQ WDDGNNTLAL APGVVVAYER
NVQTNARLQD AGIEVLTIAG SELGTGRGGP RCMSCPAARD PL