Gene Mvan_3103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3103 
Symbol 
ID4646859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3275146 
End bp3276387 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content65% 
IMG OID639806580 
Producthypothetical protein 
Protein accessionYP_953911 
Protein GI120404082 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0463748 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.241935 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAAG GCTTACGAAA GGCCGGGGTT GCCTGTGGCA CCGCCTTTGC TTCAGTTGCG 
ATAGCGCTGT CAACCGCAAC GCCAGGATTG GCCACCCCGT CCGGGCTCGT GGTGGGTGGC
TTGGCAACGC CGACGATGCA CGACCTGGTG ATGGCACAGA TCCTCAAGGA CGAGCTGCAC
GGGATGGAGC GTGTCAGCAT CTACTGGCCC GCCGAGGCGA AGCCGTACAG CGGCGACACC
TACACCCTCG GTGAGTCGAT CGCCATCGGC ATCACCAACC TCAACACCGA GATCGACACC
GCGATCGGCA AGCTCGCCCC CGGCGAGCAG GTCAAGATCG TCGGTCTGTC GGCCGGCTCA
CTGGTCGTCA CGGAGGTGCT CCGCCTGCTC GCGGCAGACC CCGACGCGCC CGCGGCCGAT
CAGCTGACGT TCATCGTGGT CGCCGACTCC AGCCGGCAGA AGCTGATCGA CAAGGCCCGT
TACAACAGCC GCTACGACTA CACCTACCAG CCGCCACCGG AGACGAAGTA CGACGTCAAG
GAGGTCACCG GTGAGTACGA CGGCATGGCA GACTTCCCGG ATCGCTGGTG GAACTTCGTG
GCGGTCGCCA ACGCGATGGC CGGCGGCATC TTCGTGCACA TACCGATGAT GTTCGCTGAC
CTCAAGGACG AGTACATCAC CGAAATCGAC GTCAACGAGC TGGGCGGAAC GACGACGAAG
TACCTGATTC CCACCGCGAA ACTCCCTCTG GTTCAACTCC TCCCGTTCCT CGCGCCGATG
GAGGCCGAAC TCAAGGAGAT GGTTGACCGG GGTTACAGCC GCAACGACAT CGTCGAGACC
AGCGCGCTGC GTACGCTCAC CGCGGCGGTC GAAGAGACCG ACGACGCCGC GGAGGTCACC
GCGCCCGCCG AGGAATCGGT CGAGGTTGAC GATGCCGACC TCAAGACAGA GGCAAAGGGC
GACGACGGCG AGGATCTCGG CGCCGTCGGC GAGGACGCCG ACACAGGCTC CAGCACGGGC
GACGACGAAG CTGAGGTCAT CGACGAGGCT GATGAAGTCG ACGAGCCCGA CGCCATCGAT
GAGGCCGTCG AGGACGCCGA CGAGGCCGAC TCGGTCGACG CCGGACAGGG CGACGAGGAC
GCCGCCGAAT CCGAGGACTC CACGGATACC GACGACTCCG ACTCGTCGGG AGACGCGGCG
TCCGACGGAA CTGGGTCGTC CGAATCCGGC TCCTCGGAGT AG
 
Protein sequence
MRKGLRKAGV ACGTAFASVA IALSTATPGL ATPSGLVVGG LATPTMHDLV MAQILKDELH 
GMERVSIYWP AEAKPYSGDT YTLGESIAIG ITNLNTEIDT AIGKLAPGEQ VKIVGLSAGS
LVVTEVLRLL AADPDAPAAD QLTFIVVADS SRQKLIDKAR YNSRYDYTYQ PPPETKYDVK
EVTGEYDGMA DFPDRWWNFV AVANAMAGGI FVHIPMMFAD LKDEYITEID VNELGGTTTK
YLIPTAKLPL VQLLPFLAPM EAELKEMVDR GYSRNDIVET SALRTLTAAV EETDDAAEVT
APAEESVEVD DADLKTEAKG DDGEDLGAVG EDADTGSSTG DDEAEVIDEA DEVDEPDAID
EAVEDADEAD SVDAGQGDED AAESEDSTDT DDSDSSGDAA SDGTGSSESG SSE