Gene Mvan_5220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5220 
Symbol 
ID4644321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5589997 
End bp5590989 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content67% 
IMG OID639808695 
Producthypothetical protein 
Protein accessionYP_955997 
Protein GI120406168 
COG category[R] General function prediction only 
COG ID[COG1545] Predicted nucleic-acid-binding protein containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.146278 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.27671 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACCA GCCAAAGCAG CCCGGTGCAG ATCGATCCCC ATGAGCCGCC GCTTTCCGCG 
CCGCTGAAGC TCGCATTCGA CTACACCCGT TCAGTAGGAC CGCTCCTCGG TGAGTTCTTC
ACCGCCCTGA GGGAGCGGCG CATCGTCGGA GTTCGTGGAT CGGACGGCAA GGTACATGTC
CCGCCCGCCG AGTACGACCC CGTCACCTGG GAGCAACTGA GCGAGATCGT ACCGGTGGCC
AGTGTCGGCA CCGTGCAGTC GTGGACGTGG CAACCCGAAC CGCTCGAGGG ACAGCCGCTG
GACCGTCCGT TCGCCTGGGC GCTGATCAAG CTCGACGGCG CAGACACCCC GCTGCTGCAC
GCGGTCGACG CCGGCTCGTC GGACGCCATC AGCACCGGCA CGAGGGTGCA CGCGCACTGG
GTGGACGAAC CCGTCGGCGC GGTCACCGAC ATCGCCTATT TCGCCCTCGG CGACCAGCCC
GAGGATGTCC CTCCGGCGCC CGAAGGCCTC GATCCGGTGA CGATGATCGT GGTGCCCACG
TCGATCGAGA TCCAGCACAC CGCATCACGT CCGGAGAGCG CGTTCCTGCG CGCACTGGAG
CAGGGCAAGC TGCTCGGCAA CCGCACGGGC GCCGACGGAA AGGTGTACTT CCCTGCCCGC
GAGGCGGATC CGGCCACGGG TGTGCAGCTC GACGAGTACG TCGAGCTGTC CGACAAGGGC
ACCGTCACAA CCTTCGCGAT CATCAACATC CCGTTCGCCG GGCAGCGCAT CAAGCCGCCC
TACGTCGCGG CGTACGTGCT GCTCGACGGC GCCGACATCC CGGTGCTGCA CCTGGTGTCC
GACATCGACG CCGACAAGGT CCGGATGGGC ATGCGTGTGC AGGCGGTGTG GAAGCCCGAG
GACCAGTGGG GTCTGGGCAT CGACAACATC GAGTACTTCC GGCCGACGGG CGAACCCGAC
GCCGACTACG ACACCTACAA GCATCACCTC TGA
 
Protein sequence
MTTSQSSPVQ IDPHEPPLSA PLKLAFDYTR SVGPLLGEFF TALRERRIVG VRGSDGKVHV 
PPAEYDPVTW EQLSEIVPVA SVGTVQSWTW QPEPLEGQPL DRPFAWALIK LDGADTPLLH
AVDAGSSDAI STGTRVHAHW VDEPVGAVTD IAYFALGDQP EDVPPAPEGL DPVTMIVVPT
SIEIQHTASR PESAFLRALE QGKLLGNRTG ADGKVYFPAR EADPATGVQL DEYVELSDKG
TVTTFAIINI PFAGQRIKPP YVAAYVLLDG ADIPVLHLVS DIDADKVRMG MRVQAVWKPE
DQWGLGIDNI EYFRPTGEPD ADYDTYKHHL