Gene Mflv_1149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_1149 
Symbol 
ID4972475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp1189851 
End bp1191101 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content73% 
IMG OID640455345 
Producthypothetical protein 
Protein accessionYP_001132419 
Protein GI145221741 
COG category[A] RNA processing and modification 
COG ID[COG5178] U5 snRNP spliceosome subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.10821 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0204141 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCACCG CGCTCCCCGA CACCGACCCG ACCGGCGGAC TGACCGCGAA CTCCATCTCC 
CTCGGACCGC GCGGCAACGA CACCTTCGAC CACGCCAAGA GAGGTGACTG CCTCACCTGG
CCGGACCGCA CGCCCGACGC CGCCGAGATC GTCGACTGCG CCGGTGAGCA CCGGTTCGAG
GTCGCCGAGT CGGTGGACAT GGGCACCTTC CCCGGAAGCG AGTACGGACC CGACGCGGCG
CCGCCGTCAG CGGCCCGGAT CCAGCAGATC AGCCAGGAGC AGTGCTCGGC GGCGGTGAAG
CGCTACCTGG GCGCGCGGTT CGACCCCAAC AGCCGGTTCA GCGTCAGCAT GCTGTGGTCC
GGCGACAAGG CCTGGCGCCA GTCCGGCGAG CGCAGGATGC TGTGCGGACT GCAGCTGCCC
GGCCCGAACA ACCAGCAGCT CGCGTTCACC GGACGGGTCG CCGACGTCGA CCAGTCCAAG
GTCTGGCCGG TCGGCACGTG TCTGGGCATC GACCCGGCGA CCAACCAGCC GACCGACATC
CCCGTCGACT GCGCCGCCCC GCACGCGATG GAGGTGACCG GCGCGGTCAA CCTGGCCGCG
AAGTTCCCGG CCGCGCTGCC CCCGGAGCCC GAGCAGGACA CGTTCATCAA GGACGAGTGC
ACGAAGATGA CCGACGCCTA CCTGGCGCCG ATCGAGCTGC GAGAGACGAC GCTGACGCTG
GTGTACAGCA CGGTGTCGTT GCCGAGCTGG GCCGCGGGCA GCCGCCAGGT GTCGTGCAGC
ATCGGGGCGA CCCTCGGCAA CGGCGGCTGG TCGACGCTGC TCAACAGCGC CAAGGGTCCG
CTGATGATCA ACGGACAGCC GCCCGTCCCG CCGCCGGACA TCCCGGAGGA ACGGCTGTCG
CTGCCGCCCA TCCCGGTGCC CGACTCGTCG TCGGGCAGCT CCAGCTCGTC GAGTTCGTCG
GGGTCCTCCA GCTCGTCGGG GTCCTCGGAC TCGTCGGGAT CCTCGGACTC GTCGTCGGGC
AGCAGCCAGA GCGAGGACCA GACGGTCCAC GGGCCGCAAG CCCCTGCGCC CGCGCCGACC
GAGCAGCCGC CGGTCAATCC GGCGCCGCCG CCCCCGGCCG CGGCGCCCGC CGACCAGCTG
CCGCCACCGG GCCCCCTGCT TCTGCCGCCG CCCCCGCCGC CGCCCGCTCC CGTGGCCGGG
CCCCCGGCCG AGCCGCTGCC GCCCGGACCT CCGCCTCCAC CGGGGGTGTA G
 
Protein sequence
MITALPDTDP TGGLTANSIS LGPRGNDTFD HAKRGDCLTW PDRTPDAAEI VDCAGEHRFE 
VAESVDMGTF PGSEYGPDAA PPSAARIQQI SQEQCSAAVK RYLGARFDPN SRFSVSMLWS
GDKAWRQSGE RRMLCGLQLP GPNNQQLAFT GRVADVDQSK VWPVGTCLGI DPATNQPTDI
PVDCAAPHAM EVTGAVNLAA KFPAALPPEP EQDTFIKDEC TKMTDAYLAP IELRETTLTL
VYSTVSLPSW AAGSRQVSCS IGATLGNGGW STLLNSAKGP LMINGQPPVP PPDIPEERLS
LPPIPVPDSS SGSSSSSSSS GSSSSSGSSD SSGSSDSSSG SSQSEDQTVH GPQAPAPAPT
EQPPVNPAPP PPAAAPADQL PPPGPLLLPP PPPPPAPVAG PPAEPLPPGP PPPPGV