Gene Mvan_3200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3200 
Symbol 
ID4648478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3403998 
End bp3406355 
Gene Length2358 bp 
Protein Length785 aa 
Translation table11 
GC content70% 
IMG OID639806677 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_954008 
Protein GI120404179 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0648133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.853857 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCGGCC ACTCGAAATC GAGTAGGGTT CCGCGCGTGA TCAAGTCTGT AAACGTCCGT 
CTCGCTGAGG AACTGGAGGT GGCAGAGACG CAGGTGGCTG CGGCTGTGCG CCTGCTCGAC
GAAGGTGCCA CGGTGCCTTT CATCGCCCGC TACCGCAAGG AGGCCACCGG CAGCCTTGAC
GACGGCCAAC TTCGGGTTCT GGAGGAACGG CTGGCGTACC TTCGCGAGCT CGACGAGCGC
CGCAGTGCGG TGCTTGCCTC GATCAAGGAA CAGGGCAAGC TCACCGACGA TCTGACCGCC
GCGCTGATGG CCGCCGACAC CAAGGCCCGC GTCGAGGACA TCTACCTGCC CTTCAAACCC
AAGCGGCGGA CCAAGGCGCA GATCGCCAGG GAAGCCGGGC TGGAGCCGCT GGCAGACCGA
TTGCTGGCCG ATCCGACACT GGTGCCCGAG CAGACGGCAG CGGAGTTCGT CGGCGAGGAG
GTGGCCGACG TCACGGCGGC ACTCGAGGGC GCACGGCACA TCATCGTCGA ACGCGCCGCC
GAGGACGCCG AGCTCGTCGG CGGGTTGCGG GAACGGTTCT GGGAGTCAGG AACGCTGCGC
ACCCGGCCGG CCTCCGAGGC CGCCGCGAGC GCGGAGAAGT CGCAGAAGTT CCGCGACTAC
TTCGACTACG CGGAACCGCT GACGGAGATG CCGTCGCACC GGGTGCTCGC GGTGCTGCGC
GGCGAGAAGG AACAGGCGCT GGCGCTGACC CTGGACGGGG GTGAGGAGGA CTCCTACCTG
GCGATGATCG CCTCCGCGCT GGGGATCGAC CTGACCGCGG CCGCACCGGC CACCCGATGG
CTGGCGTCCA CCGTCGGCTT CGCGTGGCGC ACCCGGCTCT CGGTGTCGGC GTCGGTGGAT
GCGCGGGTGC GGCTGCGCCT GCGCGCCGAG CAGGACGCCG TGACCGTGTT CGCCAAGAAC
CTCAAGGACC TGCTGCTGGC GGCACCGGCG GGCAACCGGA CCACGATGGG CCTGGATCCC
GGCTTCCGGA CCGGCGTCAA GGTTGCCGTC GTCGACGGCA CCGGCAAGGT GCTCGACACC
TGCGCGATCT ACCCGCACCA GCCGCAGAAG CAGTGGGATG CGGCCAAGGC GACGCTGGCT
GCGCTCGTCG CGCGGCACGG CGTCGAGCTG ATCGCGATCG GCAACGGCAC CGCGTCGCGT
GAAACAGACG CGCTGGCAAC CGAACTCATC GCGGACATCC GCACGGCCGG GGCGAACGCG
CCGGCCAAGG CGATCGTCAG CGAGGCCGGC GCTTCGGTGT ACTCGGCGTC CGCCTACGCC
GCCCACGAAC TGCCCGACCT CGACGTCACG CTGCGCGGCG CGGTGTCCAT CGCCCGCCGC
CTGCAGGACC CGCTCGCCGA GCTCGTCAAG ATCGAGCCGA AGTCCATCGG TGTGGGGCAG
TACCAGCACG ACGTCACCCC GGGCACCCTG GCCAGGAGCC TGGGAGCGGT GGTCGAGGAT
GCGGTGAACG CGGTCGGCGT GGACCTCAAC ACGGCGTCGG TCCCGCTGCT GGCGCGGGTC
TCCGGGATCA CCGAGTCGCT GGCCGAGGCG ATCGTCGCCC ACCGCGACAA GACCGGCCGC
TTCCAGAACC GTCGCGCGCT GCTCGACGTT CCGCGGTTGG GCCCCAAGGC TTTCGAACAG
TGCGCCGGGT TCCTGCGGAT CCGTGACGGC GAAGATCCGC TCGACGCCTC CGGCGTGCAT
CCCGAGTCCT ACCCGGTGGT GCGGCGCATC CTCGACCGCG CCAACGTCAC GCTGGCCGAG
ATCATCGGTA ACGAGCGCAC GCTGCGCGCG CTGCGTCCCG CAGACTTCGC CGACGACCGG
TTCGGTATCC CGACCGTCAC CGACATCCTC GGTGAGCTCG AAAAGCCCGG CCGGGACCCG
CGCCCGGCCT TCACCACCGC GACGTTCGCC GCGGGCGTGG AAAAGGTGGC CGACCTCAAG
GTCGGGATGA TCCTCGAGGG TGTGGTGACC AATGTGGCGG CCTTCGGCGC GTTCGTCGAC
GTGGGGGTGC ACCAGGACGG TCTGGTGCAC GTCTCGGCGA TGGCCGACCG CTACATCTCC
GATCCCCACG AGGTGGTGCG GTCCGGGCAG GTGGTGCGGG TGAAGGTGGT CGACGTCGAC
GTCGACCGGC AGCGCATCGG GTTGAGCCTG CGCCTCAAGG ACGACGTGAA GCCCGAGCGC
GGCGGCGGCC GGCGAGGTGA CCGGCCCGCC AATCCTAAAC GAAATCCGCA GCGCGCCAAC
AACTCCGGTC GCCGGGAAGC CGGCAGCGGT GGCGGGTCGA TGGCCCAGGC GCTGCGGGAA
GCGGGTTTCG GTCGATGA
 
Protein sequence
MGGHSKSSRV PRVIKSVNVR LAEELEVAET QVAAAVRLLD EGATVPFIAR YRKEATGSLD 
DGQLRVLEER LAYLRELDER RSAVLASIKE QGKLTDDLTA ALMAADTKAR VEDIYLPFKP
KRRTKAQIAR EAGLEPLADR LLADPTLVPE QTAAEFVGEE VADVTAALEG ARHIIVERAA
EDAELVGGLR ERFWESGTLR TRPASEAAAS AEKSQKFRDY FDYAEPLTEM PSHRVLAVLR
GEKEQALALT LDGGEEDSYL AMIASALGID LTAAAPATRW LASTVGFAWR TRLSVSASVD
ARVRLRLRAE QDAVTVFAKN LKDLLLAAPA GNRTTMGLDP GFRTGVKVAV VDGTGKVLDT
CAIYPHQPQK QWDAAKATLA ALVARHGVEL IAIGNGTASR ETDALATELI ADIRTAGANA
PAKAIVSEAG ASVYSASAYA AHELPDLDVT LRGAVSIARR LQDPLAELVK IEPKSIGVGQ
YQHDVTPGTL ARSLGAVVED AVNAVGVDLN TASVPLLARV SGITESLAEA IVAHRDKTGR
FQNRRALLDV PRLGPKAFEQ CAGFLRIRDG EDPLDASGVH PESYPVVRRI LDRANVTLAE
IIGNERTLRA LRPADFADDR FGIPTVTDIL GELEKPGRDP RPAFTTATFA AGVEKVADLK
VGMILEGVVT NVAAFGAFVD VGVHQDGLVH VSAMADRYIS DPHEVVRSGQ VVRVKVVDVD
VDRQRIGLSL RLKDDVKPER GGGRRGDRPA NPKRNPQRAN NSGRREAGSG GGSMAQALRE
AGFGR