Gene Plim_2354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_2354 
Symbol 
ID9139064 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp3061133 
End bp3062252 
Gene Length1120 bp 
Protein Length372 aa 
Translation table11 
GC content53% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003630379 
Protein GI296122601 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAACG AACTGATTGC ACAATCCAAA GAGCTGACAT CTCGCATCCT CCAGCTCCGA 
GACTCTCTTT GACTACGATC AGTGGCAGGC ACGAGTTGCC GAGATCAACG AAGCCATGTC
TGCACCCGAT TTCTGGGATC ACCAGGAGAA GGCGCAAGCA CTGGTGACCG AGCTGCGCAA
TGTGAATGGG TCTTTGAAAC CACTTCAGGA ACTCATTGAC GGTACTGAAG AACTGGCTGT
CTTGCGGGAA TTTCTGGCGG AAGATGACTC GTCAGAATCA CGTGACGAAA TGGTCGGCCT
GCTCGCTTCC TTGCAGGAAA AATTCCAGCA GGTCGAACTC AAGGCGATGA TGAGTCGTCC
CGAAGACCCC TGTTCGGCTT ATCTGCAGAT TCAAGCGGGC GAAGGGGGAA CAGATGCCTC
CGACTGGGCG GCCATGCTGT TGCGAATGTA CACCCGCTGG GCCGAAGACC GTGGCTTTGA
GACGGAACTG ATTGAAATTT CCGAGGCGGA AGAAGCCGGT ATTCGCAATG CCACACTCGC
CATTCGTGGA GAGTACGTTT ACGGTCTGCT AAGAGGTGAA ACGGGAGTTC ACCGCTTGAT
TCGCATCAGC CCGTTCGATG GAGCCGGTCG TCGTCAGACA TCTTTTGCGG CTGTTGATGT
GATCCCGGAA CCGGATGACA CGATTGATAT CGATATCAAC TGGGAAGATC CTAAGATCAT
CCGCGAAGAT ATCTTTCGCG CCAGTGGTGC GGGTGGTCAG CATGTGAATA AAACATCCTC
GGCCATTCGC TTGACTCACT TGCCCACGAA TACCGTGGTT CAATGCCAGA ACGAACGCAG
TCAGCATAAG AATCGCTCGT GGTGCCGCAA AATGCTGGTG GCCAAGCTGC TGCAGCTTGA
AACTGTCAAA CGGGAAAACG AAGCGGCTCA CAAGCGAGGG CAAAAGTCCA AAATTGGCTT
TGGTGGCGAA ACGATTCGTA ACTATGTCCT CAACCCGGAG CAGTTTGTTA AAGATACTCG
CACCAATCTG AAAGTTGGCA CACCCATGCC CGTCCTCGAC GGACGCATTG ACGAGTTTCT
GGAAGCCTAC TTACGCTGGG ATATGGCCGA ATCGGCGTAA
 
Protein sequence
MDNELIAQSK ELTSRILQLR DSLDYDQWQA RVAEINEAMS APDFWDHQEK AQALVTELRN 
VNGSLKPLQE LIDGTEELAV LREFLAEDDS SESRDEMVGL LASLQEKFQQ VELKAMMSRP
EDPCSAYLQI QAGEGGTDAS DWAAMLLRMY TRWAEDRGFE TELIEISEAE EAGIRNATLA
IRGEYVYGLL RGETGVHRLI RISPFDGAGR RQTSFAAVDV IPEPDDTIDI DINWEDPKII
REDIFRASGA GGQHVNKTSS AIRLTHLPTN TVVQCQNERS QHKNRSWCRK MLVAKLLQLE
TVKRENEAAH KRGQKSKIGF GGETIRNYVL NPEQFVKDTR TNLKVGTPMP VLDGRIDEFL
EAYLRWDMAE SA