Gene Msil_2029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2029 
Symbol 
ID7094227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2200206 
End bp2201696 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content60% 
IMG OID643465353 
Productphytoene desaturase 
Protein accessionYP_002362331 
Protein GI217978184 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02734] phytoene desaturase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACT GTCCCTCGGC CATCGTTATT GGCGGCGGAT TTGGCGGGAT CGCCGCCGCG 
CTGCGGCTGC GCGCCAGGGG CTGGCGGGTT GTCCTGTTCG ATCGCGCCCC GATGCTCGGC
GGCCGCGCGC AGGTGTTTGA ACGTGAGGGC TTTCGCCACG ACGCCGGACC GACCGTCATC
ACCGCCCCCT TCCTGATCGA TGAATTGTTC GCGCTTTTCG GCAAGCGCCG CGAAGACTTT
GTCGAATTCA TCCCGCTCTC GCCCTGGTAC AGATTCCGAT TCGCCGATGG CGACGTCTTC
GATTATGGCG GCAGCGTCGA AGACACGCTT GCCGAGATCC AGCGCATAGA ACCTTCGGAT
GTGGCAGGCT ATATGAATTT GCTCGAGCAT TCCCGGCGGA TGTTCGACAC GGCGTTTACC
GCTCTTTCCG ACCAGCCGTT TCACGAACTC CCGACCATGT TGCGGCAGGG CGCAGCTCTT
GCCCAGCTGC GGGCCTACAA GACGGTATGG GGCATGGTCT CGCATTATCT GACGAACCCC
AAGCTGCGAC AGGCATTTTC CATTCAGCCT CTGCTTCTTG GCGGCGACCC GTTCGAGACG
ACAAGCATTT ACAGCCTTAT TCATTATCTC GAACGCCAGT GGGGCGTCCT GTTCGCAAAA
GGCGGCGCCG GCGCGATCAT CGCAGCCCTA GCGAAGCTGA TGGCGGATCA GGGCGTCGAC
ATCCGCCTCA ATTCCACCGT CGAGCGCGTG CTGATCGAAA ACGGCGCAGC CCGCGGGGTG
AGGCTTTCGA GCGGCGAAAT CATCGCGTCC GACATCGTCG TCTCCAATGC CGACCCTATG
TATCTTTATC GTAAAATGAT CGACGAATCA GCGCAGCCGC TGTCGGTTCG CCTGAAGAAG
CACGCGAAGC TTTCGATGGG CCTGTTCGTT CTCTATTTCG GGACGCGGCG ACAATACAAG
GATGTGGCGC ACCATACAAT CTGGCTCGGG GCACGCTACA GGGAGCTGCT CGCCGACATT
TTTCAACGAC GCATCCTGCC GGAGGATTTT TCGCTCTATG TGCATCGGCC GACGGCGACG
GATGAAAGCT TTGCGCCGCC CGGATGCGAC AGCTTTTACG TTCTGTGTCC GGTTCCCAAT
CTTCTGGGCA AGATTGACTG GGCGGTTGAA GGGCCGCGGC TTCAGGCGCG CATCGTCAAG
GCGCTCGGCG CGACTCTGCT GCCCGGCCTC GGGGAGGTTA TGACAGCTGA TTTCTTCATG
ACGCCGGAGG ACTTCGCTTC GCGCTATCTG AGCTTCGCCG GGACAGGGTT CTCGATCGCG
CCGCTGTTTT CGCAGTCCGC CTGGTTTCGC TTTCACAATC GAGCCGAGGG GGTCGCAAAT
CTTTATCTTG TCGGCGCGGG AACGCATCCA GGCGCCGGAA TTCCCGGAGT GCTCTGTTCG
GCGAAAGTGC TCGACCGCCT GATTCCGCCA GCCGCGGCTT TCGTGCGATA A
 
Protein sequence
MNDCPSAIVI GGGFGGIAAA LRLRARGWRV VLFDRAPMLG GRAQVFEREG FRHDAGPTVI 
TAPFLIDELF ALFGKRREDF VEFIPLSPWY RFRFADGDVF DYGGSVEDTL AEIQRIEPSD
VAGYMNLLEH SRRMFDTAFT ALSDQPFHEL PTMLRQGAAL AQLRAYKTVW GMVSHYLTNP
KLRQAFSIQP LLLGGDPFET TSIYSLIHYL ERQWGVLFAK GGAGAIIAAL AKLMADQGVD
IRLNSTVERV LIENGAARGV RLSSGEIIAS DIVVSNADPM YLYRKMIDES AQPLSVRLKK
HAKLSMGLFV LYFGTRRQYK DVAHHTIWLG ARYRELLADI FQRRILPEDF SLYVHRPTAT
DESFAPPGCD SFYVLCPVPN LLGKIDWAVE GPRLQARIVK ALGATLLPGL GEVMTADFFM
TPEDFASRYL SFAGTGFSIA PLFSQSAWFR FHNRAEGVAN LYLVGAGTHP GAGIPGVLCS
AKVLDRLIPP AAAFVR