Gene Mvan_5379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5379 
Symbol 
ID4647095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5756160 
End bp5757230 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content69% 
IMG OID639808854 
Producthypothetical protein 
Protein accessionYP_956156 
Protein GI120406327 
COG category[S] Function unknown 
COG ID[COG5282] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03624] putative hydrolase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00392242 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCTCGT CGTCGAAGTC CGGTTTCACG GTCGGCCGTG CGGTGGACTG GAAGCTGGCC 
GCCACGTTGG GCGGCAAACT GGCGCGTCCG GAGCCGCCGG CCACCGACTA CACGCGCAAG
CAGGCGTTCG AGCAGCTCGC CGAGGCGGCC CGGGCCTCCG AGCTGCCGGT GCGGGAGGTG
ACCGGGCTGA TCGAGGGCGG TGAGATCCCC GAGGCGCGAA TCGTCAACCG GCCCGAGTGG
ATCCACGCCG CCGCGCAGTC GATGCGGGCG ATGACGGGCG GCGGGCACGC AGACGACGTC
AAGCCGCGTG CCGTCACCGG TCGTATCGCC GGTGCGCAGA CCGGGGCCGT GCTTGCGTTC
GTCTCATCGG GGATCCTCGG CCAGTACGAC CCGTTCGCCG TGGGGGGCGG AGAGCTGCTC
CTGGTGTACC CGAACGTGAT CGCCGTCGAG CGGCAGCTTC GGGTGGCGCC CAAGGACTTC
CGGATGTGGG TGTGTCTGCA CGAGGTCACC CACCGTGTGC AGTTCCGGGC CAACCCCTGG
CTGGCCGACC ACATGTCGAA GGCGCTCGCG GTGCTGACCG AGGACGCCGG GGAAGACCTG
CCCCAGGTGG TCGGCCGGCT CGTCGACTAC GTCCGTGACC GCGAGGTGGT GGTGAAAAAC
TCTGAGCCGG CGATGAATTC GACCGGTGTG CTGGGGCTGT TGCGCGCCGT GCAATCCGAG
CCGCAGCGTG AGGCGCTCGA CCGGCTGCTG GTGCTCGGCA CCCTGCTCGA AGGTCACGCC
GAGCACGTGA TGGACGCCGT CGGGCCTGCG GTGGTGCCGT CGGTGGCCTC CATCAGGCAC
CGGTTCGATC AGCGCAGGCA ACGCAGACAG CCGCCGCTGC AACGGCTGTT GCGTGCGCTG
CTCGGCGTCG ACGCGAAGAT GAGCCAGTAC ACCAGGGGCA AGGCCTTCGT CGACCACGTG
GTGGCCGAGG TCGGCATGCA GCGTTTCAAC GCGATCTGGA CCGACGCCGA GACCCTGCCG
AAGCCCGCGG AAATCGACGA ACCGCAGCGA TGGATCGACC GGGTGCTGTA G
 
Protein sequence
MSSSSKSGFT VGRAVDWKLA ATLGGKLARP EPPATDYTRK QAFEQLAEAA RASELPVREV 
TGLIEGGEIP EARIVNRPEW IHAAAQSMRA MTGGGHADDV KPRAVTGRIA GAQTGAVLAF
VSSGILGQYD PFAVGGGELL LVYPNVIAVE RQLRVAPKDF RMWVCLHEVT HRVQFRANPW
LADHMSKALA VLTEDAGEDL PQVVGRLVDY VRDREVVVKN SEPAMNSTGV LGLLRAVQSE
PQREALDRLL VLGTLLEGHA EHVMDAVGPA VVPSVASIRH RFDQRRQRRQ PPLQRLLRAL
LGVDAKMSQY TRGKAFVDHV VAEVGMQRFN AIWTDAETLP KPAEIDEPQR WIDRVL