Gene Mvan_3851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3851 
Symbol 
ID4649270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4114752 
End bp4116293 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content68% 
IMG OID639807317 
Producthypothetical protein 
Protein accessionYP_954638 
Protein GI120404809 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.116307 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.432378 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTGTT CGCGGGAGGA CGTGTTGGGG GCTTTCGATG CCCTCGATAC GGTCGTGGAG 
TCGATCCTGG CGTTGGACTA CGACGCGCTC AGTGCGGCCG AGCGGGTGGG TCTGGATGCC
CGGCTGGAAC GCAACCTGCG CCGGCTGCCG GTGGCCGAAC AGGCGTTGAT CGCCTCGGTG
ATCGCCGAGA CTGAACCGGC CCGTCTCGGT GAGGGCAGCT GGAAAAAGGT GCTGATCACG
GCGTTGCGGG TCAGTGGTGC CGAGGCGGGC CGACGGTTGA GGCGGGCCAA GGCCCTGGGC
CCGCGGCGCG GGTTGACCGG GACGCCGTTG CCGCCGTTGT GGGAGTCCAC TGCTGCTGCG
CAGGCCCAGG GCCTGCTTGG TGAGGAGCAC GTGGCGATCA TCGCGAAGTT CCACAAAGAC
CTGCCGGCCT GGGTCGATGT CGACACCCGC GCCCATGCGG ATCGGCAGCT GGCCCGCAAG
GGCGCCGGAC TCGGGCCCGA GGAACTCGAC GAGGCAGCGG GGCGGTTGAT GATGATGATC
GACCAGGACG GCCCCGAACC CTGCGACAAA GAGCGGGCCC GCAAACGCGG TGTCCGGATC
AGCAAGCAAC ATTCCGACGG CACCGCCACC ATCTCGGGCA CCCTGACCCC CGAAGCTCTG
GCCGTCTGGC AGGCGATCTT CGCCAAAGAA GCCGCCCCCG GAGCCAACCT GCCCGAGTCT
GAACACACCG AGGACAGCAC ATCCGGCGGC ACGCCGCGCG ACACCGAAAC CTCGGACCAC
GCCGACGCAT CGGGTGACGG CGTGGCCCCC AGCAGCACGG CGGGTCACGG CAACACTTCG
GGTGATGATG GTGGCGACGC GCCGGCTGAG GACCACGATC CACAGCCCGA ACGGTGCGGC
TCTGATACCC GTACCCAGGC TCAGCGCAAC CACGACGCCT TCCTGGCCGT CGGACGCCGC
CTGCTGGAAT CCGGAGAACT GGGCACCCAC AACGGGTTAC CGGTGACGGT GATCGTCTCC
ACGACGCTGC AGGAGCTCGA AAAAGGCGCC GGGGTCGCGG TCACCGGCGG CGGATCGCTG
TTGCCGATGC CCGACCTGAT CCGGCTGGCC GCCCGAGCCC ACCACTACCT CTACGTCTAC
GACCAACACA GCGGCCAATC CCTCTACCTG GGCCGGGCCA AACGGTTGGC CAACGCCGCG
CAGCGGATCG TGCTGCACGC CCGCGACCGC GGGTGTACGC GACCGGGCTG CACCGCACCC
GGGTACTGGT GCCAGGCCCA CCACGCCAGC GCCGATTTCG TCAACGGCGG ACTGACCAAC
ATCGACGACC TGACCCTGGC ATGCCCGTGC GATCACCGCA TGCTCGACAA CACCGGCTGG
CGCACCCGCA AAAACGGCAA AAACCAGACC GAATGGCTCC CACCACCAGA CCTCGACACA
GGCCAACACC GCGTCAACGG CCACCACCAC CCCGAACGCC ACCTACTCCC CGAAGACAAC
CTCCCCGAAG ACGACCTCCC CGAGGACGAC CAAGGCCCGT AG
 
Protein sequence
MGCSREDVLG AFDALDTVVE SILALDYDAL SAAERVGLDA RLERNLRRLP VAEQALIASV 
IAETEPARLG EGSWKKVLIT ALRVSGAEAG RRLRRAKALG PRRGLTGTPL PPLWESTAAA
QAQGLLGEEH VAIIAKFHKD LPAWVDVDTR AHADRQLARK GAGLGPEELD EAAGRLMMMI
DQDGPEPCDK ERARKRGVRI SKQHSDGTAT ISGTLTPEAL AVWQAIFAKE AAPGANLPES
EHTEDSTSGG TPRDTETSDH ADASGDGVAP SSTAGHGNTS GDDGGDAPAE DHDPQPERCG
SDTRTQAQRN HDAFLAVGRR LLESGELGTH NGLPVTVIVS TTLQELEKGA GVAVTGGGSL
LPMPDLIRLA ARAHHYLYVY DQHSGQSLYL GRAKRLANAA QRIVLHARDR GCTRPGCTAP
GYWCQAHHAS ADFVNGGLTN IDDLTLACPC DHRMLDNTGW RTRKNGKNQT EWLPPPDLDT
GQHRVNGHHH PERHLLPEDN LPEDDLPEDD QGP