Gene Mvan_5366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5366 
Symbol 
ID4647082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5742278 
End bp5743468 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content75% 
IMG OID639808841 
Producthypothetical protein 
Protein accessionYP_956143 
Protein GI120406314 
COG category[A] RNA processing and modification 
COG ID[COG5178] U5 snRNP spliceosome subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000423531 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACCGATC CGACCCGTGG CGCGCGCCTT CGGCGCGGCG GCCGCAGGCC GGGTTGGATC 
CTGATGACGG TGTTGCTGGT GCTCGCGATC GCAGCCAGTT CAGCGCTGGT TTTCACCAAC
CGCGTCGAAC TGCTCAAGCT GGCCGTGATT CTCGCGTTGT GGGCGGCCGT GGTCGCCGCG
TTCGTCTCGG TCATCTACCG CAGGCAGAGC GACGCCGACC AGGCCAAGGT GCGCGACCTC
AAGCTCGTGT ACGACCTGCA GCTCGATCGC GAGATCTCCG CGCGACGGGA GTACGAGCTC
GCCGTCGAAA CGCATCTGCG CCGCGAGTTG GCCTCGGAGT TGCGGGCGCA GTCCGCCGAC
GAGGTGGCGG CGCTGCGCGC CGAACTGGCC GCGTTGCGTG CGAATCTGGA ATTCCTCTTC
GACACCGATC TCTCGCACCG GCCCGCCATC GAGACCGAGC GCACCGCCGG GCGCGTCAGC
AGCAGCCGGA TCGACACCCA GGAAGACTTC AGGGCCGCCG AGGAGCCGTA CGCACCCAAG
ACCGATGAGA GTCCCATCAT CGACGTGCCG GCCGAGCCGC ACCCTCCGGA GGGCGAGTGG
GCACCGCGCG GCGAAGCCGG TGGCGCGCAT CGCCGTTCGG CCGAGCAGCC GCAGTGGGCC
CCGCCGCCCG CGCCCGCGCC CCCGCCCCCG CCCCCGCCCC CGCCCCCGCC GCCTCCACCG
CCCCCGGCGC AGCAGCCGCC ACCTCCGCCG CAGCCGACCC CGCCCCCGGC GCAGCAGCCG
AGCCCCGAGC CGCAGTTCCC CTGGCTGCCG CCCGCTCCGC CGCCACGCCC GCAACCCCGC
ACCCCCGAAC CGGCACCCAC TGCTTCCGGG TGGAAGCCGG TGCCCGCTGA GGGGCAGTGG
ATTCCGGCGG GAGAGCCCGG CAGCCACTGG GCCGCCGCGC ACGCCAACGG CGACCAGGGC
GAGTATGTGG GCCGCCGCCG GGCGCCGGAC CAGGTCGAGC CCGAGCCACC CCGCGGCAAG
CATTCCGCGG CGGGTGAGGA GCCGACGGAG GCACCTGCTG CACCGGAGGC GCCGGCCGAG
CCCGACGCCG ACGGCGGCGC GCACACCGGT GGCCAGTCGG TGGCCGAGCT GCTGGCTCGG
CTGCAGGCGG CCCCGTCGGG CGGTGGCAGG CGTAGGCGCC GCGAGGACTG A
 
Protein sequence
MTDPTRGARL RRGGRRPGWI LMTVLLVLAI AASSALVFTN RVELLKLAVI LALWAAVVAA 
FVSVIYRRQS DADQAKVRDL KLVYDLQLDR EISARREYEL AVETHLRREL ASELRAQSAD
EVAALRAELA ALRANLEFLF DTDLSHRPAI ETERTAGRVS SSRIDTQEDF RAAEEPYAPK
TDESPIIDVP AEPHPPEGEW APRGEAGGAH RRSAEQPQWA PPPAPAPPPP PPPPPPPPPP
PPAQQPPPPP QPTPPPAQQP SPEPQFPWLP PAPPPRPQPR TPEPAPTASG WKPVPAEGQW
IPAGEPGSHW AAAHANGDQG EYVGRRRAPD QVEPEPPRGK HSAAGEEPTE APAAPEAPAE
PDADGGAHTG GQSVAELLAR LQAAPSGGGR RRRRED