Gene Mflv_5089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_5089 
Symbol 
ID4976400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp5403341 
End bp5405029 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content64% 
IMG OID640459316 
Productglycosyl transferase family protein 
Protein accessionYP_001136343 
Protein GI145225665 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.174654 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.203443 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGACGG ACAGGCCCAC GATCTGTCTG AACATGATTG TGCGCAACGA AGCCCACGTC 
GTGCACGAGG TTCTCGACTC TGTAGCACCG TTCATCGACG CGTGGGTGAT CGTCGATACC
GGTTCCTCCG ACGGAACACA GGACACGATT CGCGCCCACA TGGCGACGCT GGGGATTCCC
GGGGAACTGC ACGAGAGACC GTGGCGGAAT TTCGGCCACA ATCGGTCCGA GGCGCTCGAC
CTGGCACGGG GTCACGCCGA CTACATCTGG GTGATGGATG CCGATGACCT TCTGGTCGGC
ACGCCGGATT TCAGCGGCCT CACCGCCGAT GTGTACCAGC TGCACTACGG GCCCGATGTC
TCGTACTGGC GGAGGCAGTT GTTCAGAGAC GGCCTTGGCT GGCGATATGT CGGCGTGCTA
CACGAGTACG CCGAGTGTGA CGGCCCGTGT GCCGAGGAAC GCCTCCACGG TGACTACCAC
ATCGAGTCGC GGCGCCTCGG CGGACGTAAT CTCGATCCGG AGAAGTACGC GCGCGATGCC
GAGGTCCTGC TGGCGGAGGT CGAGCGCAAC CCGGAAGATC CGCGATCGGT GTTCTATCTG
GCTCAGAGCT ATTACGACCA TGGTGACTAC GCCAGCGCGA ACCGGTGGTA TCGGCGGCGC
AGCGAAATGG GTGGTTTCGA CGAAGAGGTG TACTACTCGC TGACGCGTGT CGCAGATACG
ATGTCGCGGC TCGGCGAACC CTGGCCACTG GTTCAGGATG CGTACCTGCG CGCCTGGGAA
TACCGGCCCT GGCGGGCAGA GGCGCTCTAC GCCATCGCGC GGCAGTATCG CGATGATCAG
CGCTACCAGC TCGGCCACCT GTTCGCCGAA CGCGCCGCCC GGATACCGCT TCCCGAGGGC
GACGTGTTGT TCGTCGGCGC GGAGGTGTAC ACGTGGCGCG CGCTGGACGA GCAGGCGGTC
TGCGCGTCGT GGATCGGTAA ACAGAACGAG ACCTTCGAGA TCTGCCGACA GATCTTGCGC
CGCGACGACG TACCCGACGA CGACCGGCAA CGCGTCGCGG CGAATCGTGA CTGCGGGGTG
CCGTCGTTGT TGGAGGCCAC CGCGGTCTAC CCGGACACCT TGCCGCACAG TCTTACCCGT
GGAGGTGACG TCACGGTCAC CCTGGTCGCC GGTCCGGACC GGATGGCGAG CGAGCGCACG
CTCAATTCGT TGTTGCGCTG CTGCTCCGAC ATCACACGGG TCGCCCGGCT GATGGTCATC
GACATCGGGT TGTGGCCGGA GGATCGTGCC GCACTGGCCG ACGCCTACCC TTTTCTAGAG
TTTCGCCAAT CCCGCCCTGC AGTGCGGCGA GAACAGATTC GCAGTGAGAT CGGTACGAAA
TACTGGATGG ACCTCGGAAT GGGGTGGCAA TTCTTCGCCC AGGAGGACTA CATTCGGCGC
CTGACCTCCG CGTTGGAGGC CGAGTCGAGC GTTTGCCAGG TCGGTGTCAA CTATGGCGAC
GCCGACAAGT TGACGGGGTG CGTCGCGCCA CTGACCACGA TTCGCGGCAA CGCGGACACC
GGCCGCTATG TGCTGACCGA CACCGCCGCC GACGGCCCGG CCATGTTCGA CTGCACCCGA
TGGAACCACA CGGAAAACGC CCTCCGCACC GCCTCACTCG ACGAGGTGCT GTGCGTGCTG
CAAAGGTAA
 
Protein sequence
MPTDRPTICL NMIVRNEAHV VHEVLDSVAP FIDAWVIVDT GSSDGTQDTI RAHMATLGIP 
GELHERPWRN FGHNRSEALD LARGHADYIW VMDADDLLVG TPDFSGLTAD VYQLHYGPDV
SYWRRQLFRD GLGWRYVGVL HEYAECDGPC AEERLHGDYH IESRRLGGRN LDPEKYARDA
EVLLAEVERN PEDPRSVFYL AQSYYDHGDY ASANRWYRRR SEMGGFDEEV YYSLTRVADT
MSRLGEPWPL VQDAYLRAWE YRPWRAEALY AIARQYRDDQ RYQLGHLFAE RAARIPLPEG
DVLFVGAEVY TWRALDEQAV CASWIGKQNE TFEICRQILR RDDVPDDDRQ RVAANRDCGV
PSLLEATAVY PDTLPHSLTR GGDVTVTLVA GPDRMASERT LNSLLRCCSD ITRVARLMVI
DIGLWPEDRA ALADAYPFLE FRQSRPAVRR EQIRSEIGTK YWMDLGMGWQ FFAQEDYIRR
LTSALEAESS VCQVGVNYGD ADKLTGCVAP LTTIRGNADT GRYVLTDTAA DGPAMFDCTR
WNHTENALRT ASLDEVLCVL QR