Gene Mvan_1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1940 
Symbol 
ID4648177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp2075004 
End bp2076986 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content68% 
IMG OID639805427 
Producthypothetical protein 
Protein accessionYP_952766 
Protein GI120402937 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGCGTT ACCGTACGGG CGGGGGTCAG CACGGAAGGA CGCCGCTCTT GACGACGACG 
CCGCAGCATC CGGTCAGCCC GTTGTCCCGC CACGGGCGGT TCGTCGGCCG CGTCGGAGGG
CTCGCGGTGG CGCTAGGGAT CGGCATCGCG ATCGCGAACA GCCCCGCCGT CGCGGCTGCC
GACGACGACA CCACGAGCTC GGAGGCATCG TCCGCGTCAA CCGCGGACAG CGACACGATC
TCGAACACCC CGAACAACAC CGACAGCGAC AGCGACAGCG ACAGCGAAGA GTCGGGCGAC
GCCGGGGAGC CGGACCAGGC CGACGAGGAC GATCCGGCAC CTGACGTCGA AGACGAGGAG
CGGCGGACCC GGGTCACCGT CGACGTCCCC TCCCCCGAGT CCGAGGCCGA ACCCGACTCC
AAGTCCGAGT CCGAGCCCGA GACCGAGACC GAACCCGAGG CCGACGCCGA ATCCGTCGAT
CGCGACGACT CCGCCGATGA CCCCGTCAGC CCCGAGCCGG CGGCGGTCTG GGCACTTGCC
GGATCCGCAC GCCGCGAGAC CGCTGTCGAG TCCCCTTCGA TGACGCAGGA CATCCAGGCG
GCGGCCACGG CCAGCCCGCT GGGCACCGAA CAACAACTCG AGGCCGAGCA GATCGCCGCC
GAAACGGTGA AGACCTGGCC GGTGCGGCTG ATGAAGTTCG TGTTGAGCGT GGGCTGGCTG
GCGACAGCGC ATCGCGAGTA CAGCGAGATC AACGGCCCGG ACTGGGACAA CCTCTGGCAG
CTGCACCGGG CCGTCGACGA GTACGCGATG GGCACCGCCT TCCAGCAGCA GCTGCTCAAC
CCGATGACGC CGACCGTGGT CACCCAGGTC GCGCCGCCGC ACAGCTGGTA CGGCCGGGAC
GTCGAGGGCT CCCGCATCCT CTACGACAAC CCCGACACGA TCTACCGCTT CATGGGCGTG
AACATGACCT CCACCTACGT GATCAAGGGC CAGTTCGTCG GCGTGCACCC GGCGGACACC
AGCTTCAGCG TGCTCACCGG ACTGTCCGGC GTCACCGCGG ACTACCTCAG CGGCCGCGAC
ATCGAGATCG CACCGGACGG CTCGTTCACG ATCACCGTCA GCGGCGCGCC CGCCGCGCCG
GGCCAGGCCA ACCACCTGCA GCTGACCGCC GACACCACAC TGATCGCGGT GCGCAACACC
TTGTCGGACT GGACCACGCA GGACCCGATG AGCCTGACCA TCGAACGGTT GTCGGGTCCG
CGGAACAGTC TGTTCAGCCA GCTCGGCGGC TTCGCGATCC CCGGGCTCGG ACCGATGGTG
ACGAAGAGCC CGCTGCTGAC GACGCTGGTG TCGTTGATCC CGCCGATGAA GGAGCCGCCG
CGGATTCTGC GGGGCGCGTT CGCGGCGGTC ATCATGGGGC TCGGCCTGGG GATGGAGTCC
AAGTACATCA AGGTCGCCAC CACCGATCCG GCGACCGGTG ACCGCGTCGC GCCCAACCAC
CTACCCCACC CGTCGCGCAA CGCCGAGTTC CTGGCCACCC AGCTGCAGAG CGCCGGATAC
TTCCAGCTGT GCGACGATCA GGCCCTGGTC GTCACCATCG TGCCCGGCAA TGCGCGCTAC
TTCGTCGTCC CGGTCACCAA CCTGTGGACC GTCACGGGAA ACTATTGGGA CGAACAGACC
AGCCTGAACA ACGCGCAGGC CGTCGCGAAT CCGGACGGCA GCTACACGTT CGTCATCTCA
CCCACCGACA CCGGTGTCCA CAACTGGGTG TCGACCGGCG GACTGAACAA GGGCACGGTG
TCGATCCGCT TTCAGGACCT CGACCTGGCG TCGTCGAAGA CTCCGACGGT GACCTCCGCG
GTGGTGCCGG TGTCGGACCT GGCGGCGGTC CTTCCGCCGA CGACGGCATA CGTGACCGCC
GCCGAACGCC AGAGCCAACT CAGCGTCCGT AGGGCGGGCT TCGACCGTCG CTTCGCGGAT
TGA
 
Protein sequence
MRRYRTGGGQ HGRTPLLTTT PQHPVSPLSR HGRFVGRVGG LAVALGIGIA IANSPAVAAA 
DDDTTSSEAS SASTADSDTI SNTPNNTDSD SDSDSEESGD AGEPDQADED DPAPDVEDEE
RRTRVTVDVP SPESEAEPDS KSESEPETET EPEADAESVD RDDSADDPVS PEPAAVWALA
GSARRETAVE SPSMTQDIQA AATASPLGTE QQLEAEQIAA ETVKTWPVRL MKFVLSVGWL
ATAHREYSEI NGPDWDNLWQ LHRAVDEYAM GTAFQQQLLN PMTPTVVTQV APPHSWYGRD
VEGSRILYDN PDTIYRFMGV NMTSTYVIKG QFVGVHPADT SFSVLTGLSG VTADYLSGRD
IEIAPDGSFT ITVSGAPAAP GQANHLQLTA DTTLIAVRNT LSDWTTQDPM SLTIERLSGP
RNSLFSQLGG FAIPGLGPMV TKSPLLTTLV SLIPPMKEPP RILRGAFAAV IMGLGLGMES
KYIKVATTDP ATGDRVAPNH LPHPSRNAEF LATQLQSAGY FQLCDDQALV VTIVPGNARY
FVVPVTNLWT VTGNYWDEQT SLNNAQAVAN PDGSYTFVIS PTDTGVHNWV STGGLNKGTV
SIRFQDLDLA SSKTPTVTSA VVPVSDLAAV LPPTTAYVTA AERQSQLSVR RAGFDRRFAD