Gene Mvan_3620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3620 
Symbol 
ID4647186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3852226 
End bp3853350 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content72% 
IMG OID639807093 
Producthypothetical protein 
Protein accessionYP_954417 
Protein GI120404588 
COG category[S] Function unknown 
COG ID[COG3323] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR00486] dinuclear metal center protein, YbgI/SA1388 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.381663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAACTG TCCGGCTGGT TGACGTCATC GACGTGCTCG AAGCCGCCTA TCCGCCCGGT 
CTGGCGCAGA GCTGGGATTC CGTCGGGCTG GTCTGCGGGG ATCCGACCGA ACCGGTGGAG
TCGGTGACGA TCGCGGTCGA CGCCACCCCG CAGGTGGTGG CTTCGGTACC CGACCGTGCG
CTGCTGCTGG TCCATCACCC GCTGCTGCTG CGCGGGGTGG ACTCGGTGGC GGCGGACACG
GCCAAGGGTG CGCTCATCCA CTCGTTGATC CGGACCGGTC GCGCGCTGTT CACCGCGCAC
ACCAACGCCG ACAGCGCTTC GCCGGGGGTG TCCGACGCCC TCGCCGATGC GCTCGGCCTG
CAGGTCTGCG ACGTGCTCTC GCCGGTGCCG TCGGGTCCGG CGCTGGACAA GTGGGTGGTG
TTCGTCCCGG CCGGGAACGC CGAGGCGGTG CGCTCGGCGA TGTTCGCCGG CGGCGCAGGC
CAGCTGGGCG ACTACTCGCA GTGCAGCTGG AGCGTGTCGG GCACCGGGCA GTTCCTGCCG
GGTGACGGCG CGACGCCGGC GATCGGCTCG GTGGGCACGC TCGAGCAGGT GGCCGAGGAC
AGGGTCGAGA TGGTGGCCCC GGCGTCGCGC AGGGCCGCCG TGCTGGCCGG CCTGCGCGCC
GCCCATCCGT ACGAGGAACC CGCATTCGAC GTCGTCGCAC TGCAGACCCC GCCCGGCGAT
GTCGGGCTGG GACGCATCGC GACGTTGCCC GAACCGGAGC CGCTGCCCGC GTTCGTGTCC
CGGGTCCGAC GCGCGCTGCC CGCCACGTCG TGGGGTGTCC GCGCCTCCGG AGACCCCGCC
ACGACGGTGG CGCGGGTGGC GGTGTGCGGA GGCTCCGGTG ACTCGCTGCT GTCCGAGGTG
GCGGCTTCGG GGGTGCAGGC CTACGTCACC GCCGATCTGC GCCACCACCC CGCCGACGAG
CACCGCCGTA GCTCGGACGT CGCGCTGATC GACGCCGCCC ACTGGGCCAC CGAGTTCCCG
TGGTGCAATC AAGCCGCCGA GTTACTGCGC CACCACTTCG GTGCATCGTT GCCGGTCACG
GTGTCCGATG TGCGTACCGA TCCATGGAAT ATCGAGGGAT GTTGA
 
Protein sequence
MATVRLVDVI DVLEAAYPPG LAQSWDSVGL VCGDPTEPVE SVTIAVDATP QVVASVPDRA 
LLLVHHPLLL RGVDSVAADT AKGALIHSLI RTGRALFTAH TNADSASPGV SDALADALGL
QVCDVLSPVP SGPALDKWVV FVPAGNAEAV RSAMFAGGAG QLGDYSQCSW SVSGTGQFLP
GDGATPAIGS VGTLEQVAED RVEMVAPASR RAAVLAGLRA AHPYEEPAFD VVALQTPPGD
VGLGRIATLP EPEPLPAFVS RVRRALPATS WGVRASGDPA TTVARVAVCG GSGDSLLSEV
AASGVQAYVT ADLRHHPADE HRRSSDVALI DAAHWATEFP WCNQAAELLR HHFGASLPVT
VSDVRTDPWN IEGC