Gene Plim_4030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_4030 
Symbol 
ID9140750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5168396 
End bp5170054 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content57% 
IMG OID 
Productprotein of unknown function Met10 
Protein accessionYP_003632040 
Protein GI296124262 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAATT CACCCAACCC ACGCCGATCT GCCGCCAAAC GCCCTGCTCA GGGCCAGTCC 
TACCCGAAGT CTTCCCGTCG TCCTGCGACA AATCGACCGC CAGCGGGAGG TGAAGGTTCG
GAGACGAGGG GAGAATCCGC ACGCAATCGT CGTGGAGACA ATACTCCGCA GGGCGACAAT
CGCGCAGCAG GTGGCCGTCG CTATGGCACC CCATCTCGTG GGCCGGATGC GCGATCGCGC
GGGCCTGCAC CGCGAAATCC TGCGACGGGC GAGTTTCAAA CTCATGAAAC CAGCAATGCC
AAATCTGTCC GCCAGCGTCG GGCACAGATC GATTCGGGAA GTGAAGTTCA GCGCCGACTG
GCGTTGTACT CGCCCGAGAT GTTGAGCCCG CGTCCACTGA CCGCAGACAA GATTCCGGTC
ATTGTGGCGC GCAGCCCCAG CAGGCATCCT TACTACTTTC GCAAAATGCT CGTCAGCGGG
ATTAAGGCCG CTCAGCCAGG CGATCTCGTG AAGGTCGTGT TAGAAGAATC GCAGCAGACA
CTCGGCTACG GGCTCTACAA TCCGCGGGCC GAGATGACTG TCCGCATGCT GACGCGGGGT
GATCAGATTC CTGATGAAGC GTGGTGGAAA TCCAAGCTCG AGGCGGCCGT CAAATTCCGT
ACAGAGACCT TGGGACTCGA ACAGCAGGGC AATGTCTATC GACTGGTGCA TGCCGAAGGA
GATGGATTGC CGGGACTGAT GGTTGATCGT TATGGCGATG TTCTTTCCGT CGAGGCCTTC
AACCTGGGGA TGTATCAGCG GGCCGAAAGT ATTCTGGATC TGCTGGCGGC ATGCACCGGA
GCGAAGCATG GCATTTTGAG GCCCGGCGCT TATGCCGCTC AACTGGAAGG CTTTGATGCT
GACCCGTTTG GCTCGGAGAA TGCTCCTGAG AGTGTGCAGG TGGTCGAGAA TGGCGTGAAG
TACGAAGTTC AGTTTGAAGA TGGCCACAAG ACGGGCTTTT TCTGTGATCA GCGGCAGAAT
CGCGCGCGAG TGGCTGAGTT GGCAGCCGGG CAGCGTGTGC TGGATTTATG TTGCTACTCG
GGCGGATTTG CGCTCTCGGC GGCTGTTTCC GGTGCGAAAA GCGTACACGG CGTTGATCTG
GATGAAGCGG CCATTGCTGT GGCCAAAAAG AATGCCAAGC TCAACAAAGT CCAGATTGAA
TGGGCGCATG CCGACATTTT CGCCTGGATG CGTGAAGCTC AGAAACAGGG CCAGCAGTGG
GATATCGTGG TTCTCGACCC GCCGAAACTC ATTCGCACAC GTGATGATTA TGAAGATGGT
CGCAAGAAGT ATTTCGATAT GAATCGACTG GCCGCCAGTC TTGTGTCGCC AGGTGGTATG
TTGATTACCT GCAGTTGTTC GGGGTTGTTA TCTTCTGCCG ATTTTACACG TGTCACGGGC
TATGCGATCG ACAATGCCGG TCGATCGGCC CGACTGTGCG AGCAGACGGG TGCCGGGCCC
GATCATCCCG TGCATGTGCG CTGCCCGGAA TCGCTCTATT TGAAGGTGAA CTGGTATTGG
TTCGATGAAG CTCCCGTGAC GAAGTCTACG CACTTGCCGG AAGGCCTCAT GGAAGACTTC
ACAGACGAGA GCGATGCTTT GGCAGACTCA GACGAATAG
 
Protein sequence
MENSPNPRRS AAKRPAQGQS YPKSSRRPAT NRPPAGGEGS ETRGESARNR RGDNTPQGDN 
RAAGGRRYGT PSRGPDARSR GPAPRNPATG EFQTHETSNA KSVRQRRAQI DSGSEVQRRL
ALYSPEMLSP RPLTADKIPV IVARSPSRHP YYFRKMLVSG IKAAQPGDLV KVVLEESQQT
LGYGLYNPRA EMTVRMLTRG DQIPDEAWWK SKLEAAVKFR TETLGLEQQG NVYRLVHAEG
DGLPGLMVDR YGDVLSVEAF NLGMYQRAES ILDLLAACTG AKHGILRPGA YAAQLEGFDA
DPFGSENAPE SVQVVENGVK YEVQFEDGHK TGFFCDQRQN RARVAELAAG QRVLDLCCYS
GGFALSAAVS GAKSVHGVDL DEAAIAVAKK NAKLNKVQIE WAHADIFAWM REAQKQGQQW
DIVVLDPPKL IRTRDDYEDG RKKYFDMNRL AASLVSPGGM LITCSCSGLL SSADFTRVTG
YAIDNAGRSA RLCEQTGAGP DHPVHVRCPE SLYLKVNWYW FDEAPVTKST HLPEGLMEDF
TDESDALADS DE