Gene Mflv_4744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_4744 
Symbol 
ID4976056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp5052632 
End bp5054632 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content63% 
IMG OID640458973 
Productputative outer membrane adhesin like protein 
Protein accessionYP_001136000 
Protein GI145225322 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID[TIGR01965] VCBS repeat 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.511214 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTACG GGAATTTTGT CGGTCGGGTC GGCGCGCTCG CCGCTGCACT CGGCATCGGA 
ATCGCGATCG CGTCATTGCC GGTGTCAGCA TCGGCCGACA CGGGTGCCAG TACGTCGTCC
GAGAGTGAAT CCTCGGCCAG CCGAGAGATA TCACGAGGGG CGTCACGGGA AGCTCGGCCC
GAACACGTGA CACCGAACTC CGACGACGCC TCCTCGACTA CCGACTCTGC CGCTGATGCG
CCACACACAG TGTCGCGTCG AACCGCGACC GGACGCGCGA ATGGTGATGC GGGCGACCGC
GATCTCGACG ACGTCGTGGC GGCCTCCGAG GAAGAGGCGC CGGAGCCGAA AGCCATCGAG
GCAGGCTCGT CGACCCCCTC GGACAGCGAC GAGATCGCAA CCATCGACTT GGCCGCGAAG
TCACCCACTC CCAGCGAGTC ACGTACTCGA ACAGGTGACA TTCTTGGGAC GGCCACAACG
CCGTCGTCGA CTCCGAATCC CCCGGCGCCG GCAGACTCGG CAATCGCCCT CGCCGTACTT
GCGTCGGTAC GACGGCCCGT TTCCGCTGCG GAAACGCGCG AGCATGCGCC CGCAGGTAAT
ACGTCAGTCG TGTACGCCGT GCCCAATACT CAGCCAACCA TTTCCTCGAT ATCAGCAAGG
GCCCCGGGCC TGTTCACCGG CAACGTCACG GGCCGGGTTC GCGCCACCGA CGCAGATCGC
GACAAGCTCA GCTACACCGC GACCGCATCT CAGGGCACCG TGAAAATTAA CTCAATGGGG
TCCTATACCT ACACCCCGAG CGTTGCTGCG CGTCATGCTG CAGCAAGGGT CGGAGCACAA
GATCAGGACA CCGTCACCGT GACCGTCACC GACGGCCACG GCGGTGTTGC CACAACCTAC
ATCGGCGTGA TGATCCGCCC GAAGAACACC AACCCCACCG CGAAAGCAAC TGTGGGCAAA
GGTAATCCGG CCAATGGCAT CGTCACCGGC CAGATCGTTG GTAGCGACAA GGACGGTGAT
TCAGTGACGT ACTCTGTCGC CGAAACCACC GCCCGGGGCA GTGTGGTCAT GACAGCTGAC
GGCACCTTCG TGTACACCCC CACTGCCGGC GCTCGGGAAG CAGCAACCAG CTTCTTTCGT
CGCAGCGACC GCTTCACCGT CGCGGTCGAC GACGGCCACG GTGGTATCAA GAAGTTGAGT
GTGCGGGTCG GCATTGTTCC GCCAGGTAAG AACTCGGCGC CGACTGTGGG GAATCCGAGC
TACACCATCA CCAATGTGTC CAGCGCAGAT GGCACTGTCA CCGGCTACGT CAGCATGGCC
GATCCCGACG GTTTCGGGCT CACTTTCTCG GTCGCGGGTG GTATCGACCC GACCGTCGGC
GCTCTGGCTG TCAATGCCGC CACTGGCAGC TTCAGCTTCA CTCCGACGAC ACGAGCCCGT
GAGATCGCGC ATGGCACGGC GGGGGAAGAT TTCGTGCGCT TCGCGATCGC GGGGAGCGAC
GGACTGGATA GAACGACGGT GGAGGTTTCG GCCGCCATCA GCCCCAAAGC TCCGCCGCCG
CCGCCTCCCC CGCCGCCACC TCCGACCACC CGCATGCGGT GGCCGCTCGG CTCGGTTCAG
GTCAACCGCT ACTTCGGCGG CAATGGACAC AATGGGATCG ATTTGAACGC TTCCATCAAT
CCGAAGACAC CGGTGTATGC GGCCGCTGAC GGCGTCATCT CATTCGAGGG TTACGGACAG
AATCACTCCT GGATGACATG GCAGGCGGGC ATCAGCGTCC TGGTCTGGCA CCCGGCTTTG
AACGTGTATT CGGGGTACGC GCACCTGAGT AGCACGGTCA TCAACAACGG CCAGACGGTC
GCACGCGGTC AGCTCATCGG TTACGCAGGG TCGACCGGAA ATTCATCCGG GCCGCACCTG
CATTTCGAGG TGCTACCCCG CACGCCCAAC TTCAGCAACG GGTATTCCGG ACGGATTGAT
CCCCTGCCCT ACCTCCGATG A
 
Protein sequence
MVYGNFVGRV GALAAALGIG IAIASLPVSA SADTGASTSS ESESSASREI SRGASREARP 
EHVTPNSDDA SSTTDSAADA PHTVSRRTAT GRANGDAGDR DLDDVVAASE EEAPEPKAIE
AGSSTPSDSD EIATIDLAAK SPTPSESRTR TGDILGTATT PSSTPNPPAP ADSAIALAVL
ASVRRPVSAA ETREHAPAGN TSVVYAVPNT QPTISSISAR APGLFTGNVT GRVRATDADR
DKLSYTATAS QGTVKINSMG SYTYTPSVAA RHAAARVGAQ DQDTVTVTVT DGHGGVATTY
IGVMIRPKNT NPTAKATVGK GNPANGIVTG QIVGSDKDGD SVTYSVAETT ARGSVVMTAD
GTFVYTPTAG AREAATSFFR RSDRFTVAVD DGHGGIKKLS VRVGIVPPGK NSAPTVGNPS
YTITNVSSAD GTVTGYVSMA DPDGFGLTFS VAGGIDPTVG ALAVNAATGS FSFTPTTRAR
EIAHGTAGED FVRFAIAGSD GLDRTTVEVS AAISPKAPPP PPPPPPPPTT RMRWPLGSVQ
VNRYFGGNGH NGIDLNASIN PKTPVYAAAD GVISFEGYGQ NHSWMTWQAG ISVLVWHPAL
NVYSGYAHLS STVINNGQTV ARGQLIGYAG STGNSSGPHL HFEVLPRTPN FSNGYSGRID
PLPYLR