Gene Mflv_0923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_0923 
Symbol 
ID4972251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp965945 
End bp969070 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table11 
GC content60% 
IMG OID640455121 
Producthypothetical protein 
Protein accessionYP_001132195 
Protein GI145221517 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGTAA ACGACGAGTT GCTCGCCATT CCTGTTGACG ACCCGATCCG CCAGCGCGCA 
AGCAATATTG TCGAACCGCC TGTCGATCCG CAGCTCGAGT TGCTTCCTAC AAACTTGATG
GACTGGGAGG ACTTCGAGCG GCTGCTGTTA GATCTCGGAC GACACGAACT GGGGCTACGC
TCGCTGAGCT ACTTCGGGAA GCGAGGGCAA GCTCAGAAGG GACTCGACGT TGTCGGCACG
AATGCCCGCG GCAAGAGCGA AGGCATCCAG TCGAAGCGCT ACCAGAAGTT CGCCGTCGCG
AACTTGGACT CCGCGGTGGA GAAGTACACG CAATCCACGG TGCCGTTCAC TCTTGTCCGA
CTCGTCGTTG GTGTGAGCGC CAAAGTTGAC GATCGTGCCG TGGTGGAGCG CAAAGCCGTT
CTTAATGAGC AATATAATCC GTTAGACATC GATATCTGGG ACCAGCCACG GATTTCGGAG
ATGCTGCGCG ACGAGCCCGA GATCGTGATC AAGTACTTCG GGCCTCGGGC TGCTGAACGC
TTCTGTGTTC CGCATGTGCT CGTCCCTGTC GAGATTCCAA GTCCTGACGC GGTAGCCACC
GCAGACGCAG TTCTGCTCGG CCCGCTTATA AGTACCGATG CTCAAAGACT TGTCAATAGA
GCCAGTGAAA TCGCCGACGA CGAGCCCGAC GCGGCGCTTG CGCTCTACCA GGAAGTGCGG
AGCCGACTAT CAGGGTCAGG GTTTCCCGGC CACGCAGCCG AATTCGACGA CACAGTGGTA
GCGCTGTCTA TTCGCACTGA CCAAGCCGAG ACCGCAATTC GCCCATTGAT GGACGCGCTG
TGGGCGGCCG AGGGCAAAGG GGACTCACTA GGTGTCGACC GAGTTGGTCG GAGGTTGCGA
GACCTGGCCG ACCTCCCGGA GTTCGGACCT ACCCACAACA AAGTGCCGCG AACCCCAAGG
CTCGGCGCCG CGTTCGAGAT CGCAGACTTC GTTTCAGATC AGATGCATAC GCCGATCCCG
ACGCGCATCG AGATCCCATC CGCCGCAATT GCGCTCGCCG ATCGCGTCGA CCGCGCCCGC
GCGATCGTCT TTACAGCGGA GCGTGCCCTA GGCAACGATG ATCTGACTTG GATCGTCGAG
CACCGAGAGC AGATCGAGTC GACCGCCACT GAGGTAGACG TCAGCGACAT AGAGCTGGCG
GTCAGGCTGC GCTTAGTGAT AGCTGATGCC ACAGGAGAAT GGGCTGACTT GGTCCACACT
GCTAGAACAC GAATGCGTCG TGACCTAAAA GCGTTGACCA TCGCGCGATT TGCCCGCTAC
AAACTTCTTC AGGCGGCTCC GGCGGACGCG GACAGTGAGT GGCGCGATGC TATCGGCGAG
GCGTGCTTGG CGCAGCGACA CGCCGACGCA GCGGACTGGC TCTATAGTCA GCGTTTCGTC
GCGACCCGCT ACCGCGGAAT TGCGAAAGAC ACATGGCATC CCCTTGCACA AGCGCTATCC
GACTTGCCAT CGCGGCCGAA AATCGTCCCC ACCGCGAACG ACGCGCGGGA GCGCGCATTT
GCAGCAATGC ACTACGACGA ACCGAGGGTC GCTGCGATCA ATCTCCGGCG GCACCTCCTC
GATGGGATCC GATCGGCATC TTTCCATGAT GAACGTGAGG CCCGCCGGCT GCTCGGCGAG
ATTTACCGCA CCACAGATGA TCTCGTGCTG GCGGCCTACT ACTCTATCGG GTCAGGTGAT
CCGAAAGAAG CACGCGCAAT TGCCGCAGCG TTCGGCGACG TTTACCACGA TGTCACCGAA
TGGATGACCA GTCCGCTGTC GTGGGTGGCA TCATCCGCCC TGCAATTCGC CACCGAACAA
GCCGACCTCA TACCCGACAA AGACCTCGAT GCCGTTGTTG AGCTAGCGCT AAGCGCAATC
GACGATGTCG CGACAGGCAC GCGCCTCGAC TCTCCGATAC TGAGTCCGCA GATCTATCTG
TCGGCATACG GCCTACTCGC CGCGCTAGCC GAGCGTCTCT CAGAACGTCA TGCCCGCACG
CTGCTCGACA TGCTCGATGA CGCTGTCGTC GTGAAGGAAC ATCACTACCG ACGCACAGAT
GAAAGCCATG TCGAAATCGC GGCAGGCATC GCCCGCGCCC ATAACGGCGA GCTCCGGGAT
ACTGCGGTTG AACAGCTTGT CGGATTATAT GCTCGGGGGG CGCACCCATT TCGGGCTTCT
GCCCGCAATA CGCTGCTACG CAACCTCGAC CAAGTCCGTG ATCGGCTGCA AGAGATGGCC
ACGAACGACC ACCATGAGGC GGGCGCCCTG CTCGGCTATA GCGACCCCGG CCATGTCTCG
CCGGAAGCCG CCCGAGCCGC AGCGCAGCGC TTGAGCACGC CAACAACGAA CGGACCCAGC
GGTTTTGGAA CCGGGACGAG TGCCGTCAAC GACTCACTGC TCGCAGCCGT ACTGCCGGTC
GAGCAGCGTA TCCCGTGCAT CGAGATGCTC ATAGCGAACG CGTCATCGCC CTGGGAACCA
TCCTCGAACC GCGACAGCTA CCTCATCGCA GCAAGCAACT TAATCGATCA TCTGGATGAA
GAGCATCGCG CTCAATTCTT TGACACCGCA ATAAACTATG CGGCTAGTCC GCCTCCATCC
CAGGCAGATG CGTTCAACGC GTCAATGAGC AATCCGTTAG GCGCAATGCG AATTAACGAC
CGAAGTGACA GCCGGCCGGC AGCAGCCTTT CTCGCAGCTA GGCTTACAGC ATCAACTTTG
GAGAAGCGAG TGGTGCGCGA CGCAGCGCTG CGCCTCATCG GCGTCGGTTC GGACGACGAC
TATCGAGTCA CGACGGCACT TCAACTCGTT CAATCTGAAC TTGGTGACAG CATCGGCCTG
CTTGCGCAGG GTAGCTGGAC ACTTCGAAGT CTCGCCGCCA TTTTATGGGC CGAATCGACC
GACTTGCCAG ACGACCTCGG GCTTGCGCTG AGCCAGGACC GTGACGTCCG GGTACGACGT
GCGCTGGCAA CCGCCTTGGC GAAGGCTGAG CATCGACACG ACGCAAACGC CCGAGACATC
CTGCTGCGCG ATCCGCGGTG GTCTGTCCGT TCAATTTTGA GTTCAACTGC GCAGCTTCCC
GACTGA
 
Protein sequence
MQVNDELLAI PVDDPIRQRA SNIVEPPVDP QLELLPTNLM DWEDFERLLL DLGRHELGLR 
SLSYFGKRGQ AQKGLDVVGT NARGKSEGIQ SKRYQKFAVA NLDSAVEKYT QSTVPFTLVR
LVVGVSAKVD DRAVVERKAV LNEQYNPLDI DIWDQPRISE MLRDEPEIVI KYFGPRAAER
FCVPHVLVPV EIPSPDAVAT ADAVLLGPLI STDAQRLVNR ASEIADDEPD AALALYQEVR
SRLSGSGFPG HAAEFDDTVV ALSIRTDQAE TAIRPLMDAL WAAEGKGDSL GVDRVGRRLR
DLADLPEFGP THNKVPRTPR LGAAFEIADF VSDQMHTPIP TRIEIPSAAI ALADRVDRAR
AIVFTAERAL GNDDLTWIVE HREQIESTAT EVDVSDIELA VRLRLVIADA TGEWADLVHT
ARTRMRRDLK ALTIARFARY KLLQAAPADA DSEWRDAIGE ACLAQRHADA ADWLYSQRFV
ATRYRGIAKD TWHPLAQALS DLPSRPKIVP TANDARERAF AAMHYDEPRV AAINLRRHLL
DGIRSASFHD EREARRLLGE IYRTTDDLVL AAYYSIGSGD PKEARAIAAA FGDVYHDVTE
WMTSPLSWVA SSALQFATEQ ADLIPDKDLD AVVELALSAI DDVATGTRLD SPILSPQIYL
SAYGLLAALA ERLSERHART LLDMLDDAVV VKEHHYRRTD ESHVEIAAGI ARAHNGELRD
TAVEQLVGLY ARGAHPFRAS ARNTLLRNLD QVRDRLQEMA TNDHHEAGAL LGYSDPGHVS
PEAARAAAQR LSTPTTNGPS GFGTGTSAVN DSLLAAVLPV EQRIPCIEML IANASSPWEP
SSNRDSYLIA ASNLIDHLDE EHRAQFFDTA INYAASPPPS QADAFNASMS NPLGAMRIND
RSDSRPAAAF LAARLTASTL EKRVVRDAAL RLIGVGSDDD YRVTTALQLV QSELGDSIGL
LAQGSWTLRS LAAILWAEST DLPDDLGLAL SQDRDVRVRR ALATALAKAE HRHDANARDI
LLRDPRWSVR SILSSTAQLP D