Gene Mvan_5421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5421 
Symbol 
ID4646614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5800507 
End bp5801703 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content69% 
IMG OID639808896 
Producttype II secretion system protein E 
Protein accessionYP_956197 
Protein GI120406368 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.4117 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCGC CGCTGATCGA TCGGGTTCGG GAACGACTCG CGATGCAGTC CTCACCGCTG 
CGACCGAGCG TGGTGGCCGC CGCCATCCGG GCCGAGTCCG GCGGCGTGCT GGGAGACGCA
GAAGTGCTGA CCAACCTGCG CGAACTACAG ACCGAGTTGA TCGGAGCGGG CATTCTCGAT
CCGCTGTTGT CGGGCCCCGG CGTCACGGAT GTGCTTGTCA CGGCACCCGA CGCGGTCTGG
GTCGACGACG GCAACGGCTT GCGGCGGACC TCCGTGCGCT TCGCTGACGA CGCCGCCGTC
CGGCGCCTCG CGCAGCGCTT GGCGTTGACC GCGGGACGCC GCCTCGACGA GGCTCAACCG
TGGGTCGACG GGAATCTCAG CGGCTTCGGC GCCGGTCAGC CCGGCGGACC GCTCAGCGTC
CGACTGCACG CCGTGCTGCC GCCGGTCGCC GCAGGCGGGA CATGCCTGTC GCTGCGGGTG
CTGCGGCCGG CCAACCAGGA CCTCGACGCA CTGATCGGCG CGGGCGCGAT CGAGCCGGAC
GCCGCGGGCA TCCTGCGCGA CGTGATCTCC GCCCGGCTGG CCTTCGTCGT GTCCGGTGGG
ACGGGCGCCG GCAAGACCAC CTTGCTGGCG GCGTTGCTCG CCGCGGTGCC CGAGGACGAG
CGCATCGTCT GTGTGGAGGA CGCCGCCGAA CTCGCGCCGG ATCATCCGCA TCTGGTCAAG
CTCGTCGCAC GCTGCGCGAA CGTCGAAGGC GCGGGCGAAG TGACCGTGCG TGACCTCGTC
CGGCAGGCGC TGCGGATGCG ACCCGACCGC ATCGTGATCG GTGAGGTGCG CGGCGCCGAG
GTGGTGGACC TGCTCACCGC GCTGAACACC GGTCATGACG GTGGCGCGGG GACCGTGCAT
GCGAACAACC CCGCCGAGGT GCCGGCACGT TTCGAGGCGC TCGCCGCGCT CGGCGGCCTC
GACCGTGCAG CGCTGCACAG CCAACTCGCG GCAGCGATCT TATCTGTAAC GGGTTGTGGG
GTGTGTAGGA CTGTCACCCA CGTGGGTTTT AGATGGGGTT TTCTCGTAGG GGACACTGTT
ATCTGTAACG GTGTCACCGA CGTTGGCTCG CGGTGGGTGT TGCTGTGTTG CGGAGGCGGT
GCAGTTCGCC GTGTGCGGTG GCGAGGGCTT GTTCGAGGGT TTTGTTCTCG GCTTTGA
 
Protein sequence
MSAPLIDRVR ERLAMQSSPL RPSVVAAAIR AESGGVLGDA EVLTNLRELQ TELIGAGILD 
PLLSGPGVTD VLVTAPDAVW VDDGNGLRRT SVRFADDAAV RRLAQRLALT AGRRLDEAQP
WVDGNLSGFG AGQPGGPLSV RLHAVLPPVA AGGTCLSLRV LRPANQDLDA LIGAGAIEPD
AAGILRDVIS ARLAFVVSGG TGAGKTTLLA ALLAAVPEDE RIVCVEDAAE LAPDHPHLVK
LVARCANVEG AGEVTVRDLV RQALRMRPDR IVIGEVRGAE VVDLLTALNT GHDGGAGTVH
ANNPAEVPAR FEALAALGGL DRAALHSQLA AAILSVTGCG VCRTVTHVGF RWGFLVGDTV
ICNGVTDVGS RWVLLCCGGG AVRRVRWRGL VRGFCSRL