Gene Mflv_5159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_5159 
Symbol 
ID4976470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp5493807 
End bp5497187 
Gene Length3381 bp 
Protein Length1126 aa 
Translation table11 
GC content68% 
IMG OID640459389 
Producthypothetical protein 
Protein accessionYP_001136413 
Protein GI145225735 
COG category[S] Function unknown 
COG ID[COG4913] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.236742 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.119537 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACTGAGC AATTCCACCT GTCCCGGCTG CAGGTCATCA ACTGGGGTGT CTTCGACGGC 
TATCACGACA TCCCGTTCAG TGAAGGCGGC GCGCTGATCG CCGGCGCCTC CGGTAGCGGC
AAATCGTCTC TGCTGGATGC GATCTCGCTG GGCTTCCTGC CGTTCAACCG GCGCAACTTC
AACGCCTCGG GCGACAACAC CGCCGCGGGG TCCAGCGCGG GCCGGCGCAC CGTCGACAAG
TACGTGCGCG GTGCGTGGGG CCAGCGCAGC GACGGCGGCA CGAGCCGGGT GATGTACCTG
CGCGGTGACG GAACCGCCTG GTCGGCAGTG GCGGTCACCT ACTCCAGCGA CTCCGGGCGC
ACCGTGACGG GCCTGGTGCT GAAGTGGCTG ACGGGCGAAT CGCGCAACGA CTCGTCGAGC
CGCTTCGTGC TCGGCGACGG TGACCTCGAC ATCGAGGACA TCTGCAACCG TTGGGCGGCA
GGACGTTTCG ACACCGGTGT GTTCAAGGAA GGCCAGGGCG AGCGAAGCGA CGGTTGGCGG
TTCACCACCA AGGTCGAGTC GCAGTATCTG GCCCAGCTGT ACGCGACCAT CGGGATCCGC
GCATCGGATG CCGCCCAACA GCTGCTCGGC AAGGCGAAGT CACTGAAAAG TGTTGGCGGG
CTGGAACAGT TCGTCCGGGA GTTCATGCTC GACGAGCCCG AGAGCCTGAC CCGGCTGCCG
GAGGCACTCA AGCAGATCGA CCCGCTGGTG GAGGCCCGCG AGTTGCTCGC GGTCGCGCAG
AAGAAGCGCA AGATCCTCGG CGACATCGAG AAGATCCAGC AGCGCTATGC CTCGGAGTCC
ACCGATCTGG GCATCATCGA CCTGGTGGAC CTGCCGATGG TGCGGGCCTA CACCGATCAC
GTCCGGCTGG CGCAGTGCCC GGCCCATGTC AGCCAGCTCG ACACCACCAT CGATCAGCTC
GACAACGAGC ACGAGGACAT CACGCGGAGC CTGAATCTGG CCAAGGCGGA AGCAGATTCA
CTCAACGCGC AGATCAGCGG CTCGAGTGCG AGCATCGGTC CGCTGCAGTC GCAGGTGACC
GCCGCCGAGA CCGAGGCCGA GCAGGTGTCG CGGCGCCGCA ACGCCTACGA GGACTTGCTC
GCCGCGCAGG GTCTCGACGC GCCGGAGACC GCCGACGCGT TCTGGAACCT GCGCGAGGAA
CTTCTCGCGC AGGCCACCGA TCTGCTCGCC AAGGTCGAAC GCAACCGGGA GGCGTCCACC
GACGCGGAGT ACGCGCAGAA GTCCGCCCGG CTCGTCCGCG ACGAGGCCGC CAAGGAACTC
AAACGCGTCG AGCACGTCGG CTCCGCGCTG CCCGAGTTCG CGCTGACGAT GCGCGATCAG
ATCTGCGCGG CGGTCGGGGT GGACGCCACC GACCTGCCGT ATGTCGCCGA GCTGATGGAC
CTCAAGGCCG ACCAGACCCG TTGGCGCACC GCGGTGGAGA AGGTGCTGCG CGGGGTCGGC
TTGCGGTTGA TGGTGCCCGA CCAGCACTGG ACGGAGGTGC TGCGCTTCGT CAATGAGACC
AATATGCGCG GACGGCTGCA GCTGCACCAT GTGCGCGCGA AGTTCCTCGG CGCCACCCCG
GTCGATCCGG AACCGAACAC GTTGGCGGGT AAGCTGTTCG CGGTCGATCC GGAGCACCCG
TGTGCGGCCG AGGCCGTCGA CGTCGTGACC GCCGCGGGTG ACCACATCTG TGTCGACACC
CCCGAGGTGT TCGCCCGGTT CCGCCGCGCG GTCACCGACA CCGGCCTGTA CAAGGATTCC
GACCGGCTGG CGATCAAGGA CGACCGGCGC CCGCTCAAGC AGTCCGAGTA TCTGTACCAG
GGCGACGTGT CGGCCAAGAT CAACGCGCTG ACCGTCGATC TCGCGGCGGC CGAGGAGGCT
TACCAGAAGG CGCGCCGCGT CGCCGACGAC ATCGCCGCGC AGCGCCAGCA GTGGCGTGAC
CGGGCCGCGG CGTGCAAGGC GATCTGCGAG CAGTACCCGC AGTGGAGCCA GATCGACACC
GAGACCGCCG ACGGGCACGC CGACCGGCTG CGCGAGCAGT ACGAGCTGCT GCTGGCCGAG
CATCCCGACA TCGAGGCGCT CAATGCGCGC GCCGACGAAT GCTGGTCGCA GATCCAGAAG
TTGATGACGC GACGCGGCGC GGTCCAGACC CGGCGCGACG CGCTCGACGA CCGCAGGACC
CGGCTGCTGG AACTCGCCGA ACGGCTCCAG CCCGCGTTCG TGTCGGAGCC GCTGACCGAG
CTGCTGCACC GGTACGCCGG ACAGCTTCCG GTCAGCCTGG AACTCCTTGA GCCCGAACCG
CATCGCGACG CGCTGTTCAA CGCGATCAAG AAGGAGCGCG AGCAGCTGCG CGAGAGCCGC
CGCCGCTCGT ATGACGAACT GGCCCGCATT CTCAATACCT TCGACACGTC GTTCCCCGAC
GCGATCCCGA ACGACTCCGA GAACTTCGAC GAGCGGGTCC ACGACTACGT CGCGCTGTGC
CGTCACATCG ACGAGCGCGA GCTGCCGGAG GCCTACGAGC GGATGATGCG GCTGGTCACC
GAGCAGGCAC CGGATGCGAT CCTGACGCTG CACCGGGTCG CCGAGCAGGA GGCGCGGCGG
ATCAGCGACC AGATCGACCG TGTCAACACG GGTTTGGGGG CGGTGGAGTT CAACCGCGGC
ACCCGCCTGA CGCTGCGCGC GACGCCGCGC AGCCTGACGG CGGTGTCGGA GCTGACCGAG
ATCGTGCGGG CGATCTCGCG GCGCATCGCC GAGGTCGGGC TCGGCGACAA GCAGGCGATC
CTGGACCAGT ACGCCGACAT CCTGCGGCTG CGCAACCGGC TGGCGTCGAC GTCGCCGGAG
GACAAGGCGT GGACGCGCGA CGCGCTCGAT GTGCGCAACC GGTTCACGTT CGACTGCGCC
GAATGGGATG TGGCGACCGA GGAGTTGATC CGCACGCACA GCAACGCCGG CGACAACTCC
GGCGGCGAGC AGGAGAAGCT GATGGCGTTC TGCCTCGCCG GTGCGCTCAG CTTCAACCTC
GCGGCCCCCG AGAGTTCGGA CAACAGACCG GTTTTCGCGC AGTTAATGCT CGACGAGGCG
TTCTCGAAGT CGGACCCGCA GTTCGCCCAG CAGGCGCTGC AGGCGTTCCG CAAGTTCGGG
TTCCAGCTGG TGATCGTCGC GACCGTGCAG AACGCGACGA CCATCCAGCC CTACATCGAC
AGCGTGGTGA TGGTGTCCAA GACCGAGGCG ACGGGACGCA ACGCCCGCCC CGTCGCGACG
GTGTCGACCC GCACCATCTC GGAGTTCACC GATCTGCGGC ACGCGATGCG CGCCGAGGCG
AAGGTCCCGG CGGGCGTCTG A
 
Protein sequence
MTEQFHLSRL QVINWGVFDG YHDIPFSEGG ALIAGASGSG KSSLLDAISL GFLPFNRRNF 
NASGDNTAAG SSAGRRTVDK YVRGAWGQRS DGGTSRVMYL RGDGTAWSAV AVTYSSDSGR
TVTGLVLKWL TGESRNDSSS RFVLGDGDLD IEDICNRWAA GRFDTGVFKE GQGERSDGWR
FTTKVESQYL AQLYATIGIR ASDAAQQLLG KAKSLKSVGG LEQFVREFML DEPESLTRLP
EALKQIDPLV EARELLAVAQ KKRKILGDIE KIQQRYASES TDLGIIDLVD LPMVRAYTDH
VRLAQCPAHV SQLDTTIDQL DNEHEDITRS LNLAKAEADS LNAQISGSSA SIGPLQSQVT
AAETEAEQVS RRRNAYEDLL AAQGLDAPET ADAFWNLREE LLAQATDLLA KVERNREAST
DAEYAQKSAR LVRDEAAKEL KRVEHVGSAL PEFALTMRDQ ICAAVGVDAT DLPYVAELMD
LKADQTRWRT AVEKVLRGVG LRLMVPDQHW TEVLRFVNET NMRGRLQLHH VRAKFLGATP
VDPEPNTLAG KLFAVDPEHP CAAEAVDVVT AAGDHICVDT PEVFARFRRA VTDTGLYKDS
DRLAIKDDRR PLKQSEYLYQ GDVSAKINAL TVDLAAAEEA YQKARRVADD IAAQRQQWRD
RAAACKAICE QYPQWSQIDT ETADGHADRL REQYELLLAE HPDIEALNAR ADECWSQIQK
LMTRRGAVQT RRDALDDRRT RLLELAERLQ PAFVSEPLTE LLHRYAGQLP VSLELLEPEP
HRDALFNAIK KEREQLRESR RRSYDELARI LNTFDTSFPD AIPNDSENFD ERVHDYVALC
RHIDERELPE AYERMMRLVT EQAPDAILTL HRVAEQEARR ISDQIDRVNT GLGAVEFNRG
TRLTLRATPR SLTAVSELTE IVRAISRRIA EVGLGDKQAI LDQYADILRL RNRLASTSPE
DKAWTRDALD VRNRFTFDCA EWDVATEELI RTHSNAGDNS GGEQEKLMAF CLAGALSFNL
AAPESSDNRP VFAQLMLDEA FSKSDPQFAQ QALQAFRKFG FQLVIVATVQ NATTIQPYID
SVVMVSKTEA TGRNARPVAT VSTRTISEFT DLRHAMRAEA KVPAGV