Gene Mflv_3803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_3803 
Symbol 
ID4975119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp4059988 
End bp4063533 
Gene Length3546 bp 
Protein Length1181 aa 
Translation table11 
GC content66% 
IMG OID640458027 
Producthypothetical protein 
Protein accessionYP_001135063 
Protein GI145224385 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCGCCG CCAGTCCCCG AAAGTCGCGT CGGACCCCCG GCCGTCACCG CAAGCCCAGC 
AACCATGCCC ACTGGTTGAG GGTCGGTGCC GTCGGCTTCG GTCTGGCCGC AGCGATCATC
AGCGGTCAGG GCGTGGCGAC CGCGGATACC GACGACTCGT CGGGGTCGGG CGAGAGCAGC
GCGACGGCCA GGGGTAAACA GGCCGCCGTC GCCGATCGCC CGAGCGCGTC GGAGCAGAGC
GAATCGCCTT CGTCCCCCGA CAGCGACAAC GATGGCGTCG GCACCGACGA CGAGACCACG
GACGACACCG ACACCGACAC CGGCGCGGAG AGCCCGGACG TCGAGGTAGA CGAACAAGAT
CAGGATCCGC AGCCCGACAC CGTCGAGGAG CCCGACGAGG ACACGTCCGA GGTGACCGTC
GACGAATCCG ACGGTTCGCG CACGCACCAG AAACCGGTCG TCACCGAGAC CACGCCCCCT
GCCGAGGCAA GCCCGCCGTC GACACCCGCG GACGGGACCG ACGCGCCTCA ACCGGAAACG
GCGCCTCCCG CCACGGGCGG CAACAAAGCT GACCCGGGCA CCACGCCCGC GCCGAGTTCG
CCCGGCGGGT CGACGGGCGG CCCGTCGACC GCGGCACCTG ACGACGAAGC CGCCACACCG
GAGACTGAGG CCGAAACCGA AGCGGCGTAC CTGCTCAATT TCGTCAAAGA GTTGGTGAAG
GTGTTCGGAG GGGCCACGAC CGACGGTGGC GGCACCGCAC CGCCGATGTC GTGGGACAAA
GTGCTGCAGG GCCTCTTCCT CGGGCGCAAG GATCAAGAGA CGAGAAACGC CACCAGAGAG
TTGATTGAGC TCGCCAACGA GAAGACGCTC GCCGAGCTCG GCACGTCGAT CGGATGGGTT
CCCTTCGTCG GGACGGGCCT GTACGGGGCC AGCTTCGTGG GCAACCTGGG AGCGCTGATC
AGCGCTGTGC TGCGCGGCGA CAGCGCTGAC ATCGCTGACG AATTCCGTGA TCTCGGCCGC
GACGTGGTCG GCATGGTCCC GGTCGTCGGC GCACCCACGG CGGCGAAGCT GTACGCCGGC
GGATCGGACG CCGTCGGTGC GTCAGCCCAG ATGTCGGCCC AATTGTCGGC GCAGATGACA
GCGATGGCGA CAGCGACGCC ACCCCCTCCA GTGGGAACGC AGGCGCACTT CCTGTGGGTG
CTGCAGCGCG CCGTGAACCG GTTGGCGGGC TGGCCCGGAC CGGCGGGTCA GAACTTCGTC
GACATCACGA ACCGTGTGAC CGACCAGACC CTGGACAACG CCGACAATCA GCTCGACATC
CTTATCGCCA ATGCGTTCGC GGGCAGTCCG GCTGTGTGGC TTCCCGACCT GGGCAGGGTT
CTGGGGCTCT TCGTACTGAC GGCAATACCC GGGTACTCCT ACACCGACAG CCTGAACGCC
TGGGGCAGCT TCCTGAACAA GATCATGCCG CCTTACAAAA TCGCCGACGG GGCGGGCACT
CTCGACGTTA TCTCGAACTA CAAGATCACG GGTGCTGCCG TGGTCGGCGC AGCGACTCTT
CTGAGGGACA TGCTGAACGG GATCTACGAT CCGGTTCAGT GGGAGATCAA CATCATCAAG
ACGACCACCG GGGCGACCGT CACGGCGTCG GATCTCAATG ACTTCAACAC GATCATGACC
AAGGTCGCCG CCGCTCAGGC CGGGGCTATT TTGATCGGCG GTGACGGTGG TGCGTTCGAC
GATCCGACCA GAGCCTGGAA TGTCACGCTG CCCACCTGGA CCGAGGCTCA GGTTAATCCG
TACACGATCA CCACCTACGT AGCGCTCGTC GCCATCTACA AGCGCTTCCA GGAGATGGCC
ACGCTGACGA CGTTCACCAC ATGGACGACG TACGACAGTT GGCACTACAC CAACGCTTTG
GGCATGTACG CGGCCGGAAC GTTCCACGCA GTCGATCCCG ACGGCGGCTC GATCGAATTC
CGTGCCGACG GCACGCTCGG CAGGACGTAC ACCACTGAGG GCAATGCGCT GGTGACCATC
AACACCGTCG GAGGCGGCTT CACCTACACT CCGCCGGCGT TGTGGGACCC CCAGGCTCAA
GGCGCGGCGT TCCGCCACCG CAGCACTGCA GAGGACCCGG AGGAGAGGTT CGACTGGGTC
ACCGTGGACG CGTACTCCGC CGACGGCGTC CCCTACAGCC TCCGGATGGG CATCGAGATC
ATCGACGGTA CGAACGCCGT GCCGGTGTAC ACCGGTGTCA CGGGCCAGAG CACGGACGCG
TTGGGCGTGG TGAAGGGCAA GCTCAACGCG ACAGACTCCG ACGGCGACCC AATTAGGTAC
TACCTGGTCG AATCATCGGT CAACGGTCTG AACGGTAACT CCGCCTACAC CAAGAACGGC
GCCGGCAACG GCGGCATCGT CACCGTCAAC GAGAACGGTG ATTTCACCTA CGTGTCGAGT
GCGACTGCCG GCGCCACGCA GAGCTTCCAG GTGCGGGTCA ACGACGCCCA CCACGGCAAC
ACGATCGTCA CGGTCACCGT GCCGAACACC ACCAGCATCA CCCCGGGCAA CGTCAACACC
TCGACGCCCT ACGTGGTCAC CGGGACGGTG CCGGCTTCGA CCAACAAGCC GGGAGCGTTC
ACCAGCTACA CCCTCGTCGG CGGCACGACG AAGGGCACGG TGACCTCGTT CAACCCGGTC
ACCGGCGCGT TCACCTACAC CTCCAGTGTG GGCCGTGTGT TGGACAACGA CGATGTGATC
ACCGTGATCG GCACCGACGC CGACGGTCGT TCGGTGACGC TGCGCCTGAA CGTGAAGCCG
ACGACGGTCA CCGTCGCTCC AACACTGACG CTGACCACGG CCCCAACGGT CGGCACGCTC
GTCGGCACCA CTCAGACCAG CACGGGCAAG TTCACGTACT TCGATGCCGA CGGCGACGCG
CCGATCTGGC CGACAAGTGT GACCAGCTCG CGCGGCGGCA CCGTCACCGT GGCCGCCGAC
GGTACGTTCA CCTACACGAG CAACCTCACA GTGGCGCAGC GCCACGCGGT CGCCCGGATC
GGGGCCGCGG GCAGCACCTT CAACGGTGTC GCCCTCGCTG CCTGGGAGGA TGCGTTCGCC
ATGACGGTCT CCGATGGGTT CGGGGGCACC GCCACGCAGA CGGTGAAGGT GCCGATCTAC
GCGATCAACG CCAATCCCAC GCTGGGCCTC GGGGGCCTCG CGTGTGGGTT CGGGACCTGC
ACCATCACCA TCACGACGAC CGACCCGGAC GGCGACGATC TCAGCGGCAG CTTGAACACC
TCGAACAACG GCCAAGGTGA TCCCTGGTAC ACCCTCGAGA GAGGGTCGGT CACTATCAAC
GCGGGAAACC AGCACACGAT GAGCTGGACC GGAAACAGCG GCGGCCTTGG CACCCAGCAA
ACGGGTGTCC AGAGATACAC CGTCTATGAC GGCTACTACC GGGTCACCAA CGGCGTCGTC
GACTCCAGCT ACTTCGCGCG GGCGTGGGTT AACTGGAACA ACACCACCAG GACCACCGGG
AACTAG
 
Protein sequence
MAAASPRKSR RTPGRHRKPS NHAHWLRVGA VGFGLAAAII SGQGVATADT DDSSGSGESS 
ATARGKQAAV ADRPSASEQS ESPSSPDSDN DGVGTDDETT DDTDTDTGAE SPDVEVDEQD
QDPQPDTVEE PDEDTSEVTV DESDGSRTHQ KPVVTETTPP AEASPPSTPA DGTDAPQPET
APPATGGNKA DPGTTPAPSS PGGSTGGPST AAPDDEAATP ETEAETEAAY LLNFVKELVK
VFGGATTDGG GTAPPMSWDK VLQGLFLGRK DQETRNATRE LIELANEKTL AELGTSIGWV
PFVGTGLYGA SFVGNLGALI SAVLRGDSAD IADEFRDLGR DVVGMVPVVG APTAAKLYAG
GSDAVGASAQ MSAQLSAQMT AMATATPPPP VGTQAHFLWV LQRAVNRLAG WPGPAGQNFV
DITNRVTDQT LDNADNQLDI LIANAFAGSP AVWLPDLGRV LGLFVLTAIP GYSYTDSLNA
WGSFLNKIMP PYKIADGAGT LDVISNYKIT GAAVVGAATL LRDMLNGIYD PVQWEINIIK
TTTGATVTAS DLNDFNTIMT KVAAAQAGAI LIGGDGGAFD DPTRAWNVTL PTWTEAQVNP
YTITTYVALV AIYKRFQEMA TLTTFTTWTT YDSWHYTNAL GMYAAGTFHA VDPDGGSIEF
RADGTLGRTY TTEGNALVTI NTVGGGFTYT PPALWDPQAQ GAAFRHRSTA EDPEERFDWV
TVDAYSADGV PYSLRMGIEI IDGTNAVPVY TGVTGQSTDA LGVVKGKLNA TDSDGDPIRY
YLVESSVNGL NGNSAYTKNG AGNGGIVTVN ENGDFTYVSS ATAGATQSFQ VRVNDAHHGN
TIVTVTVPNT TSITPGNVNT STPYVVTGTV PASTNKPGAF TSYTLVGGTT KGTVTSFNPV
TGAFTYTSSV GRVLDNDDVI TVIGTDADGR SVTLRLNVKP TTVTVAPTLT LTTAPTVGTL
VGTTQTSTGK FTYFDADGDA PIWPTSVTSS RGGTVTVAAD GTFTYTSNLT VAQRHAVARI
GAAGSTFNGV ALAAWEDAFA MTVSDGFGGT ATQTVKVPIY AINANPTLGL GGLACGFGTC
TITITTTDPD GDDLSGSLNT SNNGQGDPWY TLERGSVTIN AGNQHTMSWT GNSGGLGTQQ
TGVQRYTVYD GYYRVTNGVV DSSYFARAWV NWNNTTRTTG N