Gene Plim_1243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1243 
Symbol 
ID9137937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp1593951 
End bp1595312 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content53% 
IMG OID 
ProductHtrA2 peptidase 
Protein accessionYP_003629276 
Protein GI296121498 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAGT GGCTTATCAC GCCAGCGATT TACGGAGTTC TCTGCCTGAC AGCAGTGCTG 
TCCAGTGCCT CGGAGCTTCG TGAAACCCCA GCCGTCCGGG CCTACAAAAG GGCATCGGCT
TCCGTTGTGA ACATTCACAC TGAGAAGTCA GCTCAGGAAC GGGACTCTGT CTTTGCCTCT
AGCCGAGGTC GCAAGATTAA CGGCATGGGG ACCGGCATCG TCATTGACGA ACGCGGGTAT
ATCGTCACCA ATCACCATGT GGTGGCTGAT GTCGAACTGA TTCGTGCGAC ATTCGAAGAT
GGCAGCGATT ACGATGCCCG CGTCATCGGC GTCGATAAAG AACAGGATCT GGCGGTCATC
AAGGTGGATG GCACCAAGAC ATTCAAAGTC GCTCCCTTCG GAACATCGAG CGATATCTAC
CTTGCTGAAC GCGTACTGGC GATTGGTAAC GCTTATGGTT ATCGCCACAC CGTGACAGAA
GGCATTGTCA GTGCTCTGGG CCGCGATGTG GAAGTCAATG AGACGCAATC ATACCGCAAT
TTGATTCAGA CCGACGCCAG CATCAATCCG GGCAACAGCG GTGGCCCGCT GATCAATATG
GACGGTGATG TGATCGGTGT GAATGTTGCT ATCCGGGCCG GGGCGCAGCG AATTGGCTTC
GCGATTCCCA TCGACGACGC TCGCAAGGTG GTGGCTCGAC TGATCTCTGT TGAGCAGATG
GGTTTGGGTT ATCACGGCGC GATTCTCAGA GATCTGAAAA CGGCAACACA AAAGCTGTTG
ATTATTGAAA ATGTGCTCTC CGACAGCCCT GCACAACGCG CCGGTTTGAA GGCTGGTGAT
GTGGTTCTCA AGGCCGGTTC ACTGGAAGTA AGTGATTCTG TCGATTTCGA ACGATCGCTC
CTGGGCCGTA AGCCTGGCGA TAATCTGGAT CTGGTTGTTC GGCGTAATGA TCGGGATGAA
AAACTGAATT TTGCTCTCGG GCAATCCAAT ATCTCTCTTG TGCAGAATCA GGCATTTCGT
CCTGCATCCA CCGGAATCAA TGAGACGGAA GCTCAACGGT TCTGGCAGAT TCTGGGCTTG
AAACTGGCAC CGATTGCTGC TGATCAAAAG CTGCTGACAG GTACACGTTA CCGTGGTGGA
TTGCGAGTCG TCGATGTCCG CCCGGACAGC CCGGCTGCTT CGAACGGGAT CACCAAGGGC
GATATTCTCG TCGGTCTGCA TGATTGGGAA ACACTCTCTG TCGAGAACGT GACCTGGATC
GTCAACAAAT CGAACGAAAT CAAGCTGAAC CCAATCAAGT TTTACATTGT ACGCGGTCAG
GAAACATTGT TCGGCCACCT GCAGACCGCC AGCCGCCAGT AG
 
Protein sequence
MIKWLITPAI YGVLCLTAVL SSASELRETP AVRAYKRASA SVVNIHTEKS AQERDSVFAS 
SRGRKINGMG TGIVIDERGY IVTNHHVVAD VELIRATFED GSDYDARVIG VDKEQDLAVI
KVDGTKTFKV APFGTSSDIY LAERVLAIGN AYGYRHTVTE GIVSALGRDV EVNETQSYRN
LIQTDASINP GNSGGPLINM DGDVIGVNVA IRAGAQRIGF AIPIDDARKV VARLISVEQM
GLGYHGAILR DLKTATQKLL IIENVLSDSP AQRAGLKAGD VVLKAGSLEV SDSVDFERSL
LGRKPGDNLD LVVRRNDRDE KLNFALGQSN ISLVQNQAFR PASTGINETE AQRFWQILGL
KLAPIAADQK LLTGTRYRGG LRVVDVRPDS PAASNGITKG DILVGLHDWE TLSVENVTWI
VNKSNEIKLN PIKFYIVRGQ ETLFGHLQTA SRQ