Gene Mvan_4102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4102 
Symbol 
ID4648711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4394462 
End bp4395682 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content65% 
IMG OID639807568 
ProductErfK/YbiS/YcfS/YnhG family protein 
Protein accessionYP_954885 
Protein GI120405056 
COG category[S] Function unknown 
COG ID[COG1376] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00342341 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGCAGG TCAACACGGC AACCCGGGCG CGGCGGAGGC GGTCTTGGCT GACGGTCGCT 
GTGCTGGTTC CCGCGATTGC GCTCGGCCTG ACCGCCTGCG GCGACAACGC AGAACCGCCC
CAGGCCCAGG TGATCACCGA CAAGGGCACA CCGTTCGGTG ACCTGCTGGT CCCGAAGCTG
ACGGCCTCGG TTCAGGACGG CGCCGTCGGC GTCAGTGTCG ACTCGCCGGT CACCGTCAGC
GCCGATTACG GTGTGCTCGG CGCCGTGACG ATGGTCAACG AGGACGGCGA GCCCGTCGCG
GGGCAGCTGT CCGAGGACGG GCTGACGTGG GAGACCGCGG AACCGCTCGG CTACAACAAG
AGCTACACGC TGACTGCACA GTCACTCGGT CTCGGCGGTG TGACCAGCAG TCAGATGACG
TTCGAGACGC ACTCGCCGGA AAACCTGACG ATGCCCTACG TGCTGCCCAA CGACGGTGAG
GTCGTCGGTG TCGGGCAGCC GGTGGCGATC CAGTTCGACG AGAACATCCC GAATCGCCTC
GCCGCGCAAC GCGCGATCAC CGTCAAGACC ACTCCGCCCG TCGAGGGCGC GTTCTACTGG
CTCAACAATC GCGAAGTGCG TTGGCGCCCA GCCAAGTACT GGAAACCCGG AACGACGGTC
GAGGTGGCGG TCAACACCTA CGGAGTGGAT CTGGGCGACG GTCTGTTCGG TCAGGACAAC
GTCAAGACGA GCTTCAAGAT CGGTGACGAG GTCATCACGA CCGTCGACGA CAACACCAAG
ACGCTGACCG TGCGCCGCAA CGGCGAGGTC ATCAAGAGCA TGCCCGTCTC GATGGGTAAG
AACAGCACCC CGACCAACAA CGGGGTGTAC ATCGTCGGGG ACCGGCGGTC GCACATGGTG
ATGGACTCGT CGACATACGG CGTTCCGGCC AACTCGCCCA ACGGGTACCG CACCGAGGTC
GACTGGGCCA CCCAGATCTC CTACAGCGGC ATCTATGTGC ACGCCGCCCC GTGGTCGGTG
GGCAGCCAGG GCTACAGCAA TGTCAGCCAC GGCTGCATCA ACGTGAGCAC CAGTAACGGC
CAGTGGTTCT ACGACAACTC CAAGCGCGGC GACATCGTGG AGATCGTCAA CACCGTGGGA
TCGCCGTTGT CGGGCACCGA CGGGCTGGGC GACTGGAACA TCCCGTGGGA TCAGTGGAAG
GCCGGCAACG CCAACCTCTG A
 
Protein sequence
MWQVNTATRA RRRRSWLTVA VLVPAIALGL TACGDNAEPP QAQVITDKGT PFGDLLVPKL 
TASVQDGAVG VSVDSPVTVS ADYGVLGAVT MVNEDGEPVA GQLSEDGLTW ETAEPLGYNK
SYTLTAQSLG LGGVTSSQMT FETHSPENLT MPYVLPNDGE VVGVGQPVAI QFDENIPNRL
AAQRAITVKT TPPVEGAFYW LNNREVRWRP AKYWKPGTTV EVAVNTYGVD LGDGLFGQDN
VKTSFKIGDE VITTVDDNTK TLTVRRNGEV IKSMPVSMGK NSTPTNNGVY IVGDRRSHMV
MDSSTYGVPA NSPNGYRTEV DWATQISYSG IYVHAAPWSV GSQGYSNVSH GCINVSTSNG
QWFYDNSKRG DIVEIVNTVG SPLSGTDGLG DWNIPWDQWK AGNANL