Gene Plim_3233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3233 
Symbol 
ID9139948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4175191 
End bp4176651 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content56% 
IMG OID 
Productprotein of unknown function DUF900 hydrolase family protein 
Protein accessionYP_003631246 
Protein GI296123468 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.139219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTATT TCGTTTCGAT CTGCCTCACG ACTGCCCGAC GTTGTGGCAG AGTCATGATT 
CTGTGCCTCG TTCTGGCAAT GACTGGCTGC GCAAAAGCTC CGCAGGATGC CAAGACAGCC
ACCGATTCCT CGGATAAGGG ATTTGTCGTC TCTCCGTTGA CGGCTTCGGG TGGCCCGGCT
GGTGCAGCCG ACTTCCCACT TGACGCTGGC TCGACAGGTT CGGCGCCATC CATCTCCGAT
GGCCCGATTC TGCCCGATCT TCCAGCACAG TCGGAACCTG CGATTGCTGG TAGCGAAGCC
CCACCTTTTC CTGCCAACGA GCCAGGTATG GAAACGGTAA CTGTCACCGT GAATGGAAAA
GCTGCGATCG AACCCAAAAC CATCGAGTTT GGAGAATTCG ATCCGTCACA AGCTGCTGCT
TACGAGATCC AACCCCGGCC CGCGATGTCA CCCACAGCCA GGCAATCGAC AGATAGCGGA
TTTACACGCG AAACCATCTA CTTTGCGACC AATCGAAAAC GCACGGGTTC AACCAGTGCC
AAAGAGGCAT ACAGTTCCCA TCGAGGCGAA CTGCAATGGG GAATGTGCCA GGTCAGCATT
CCTTACAAGC ATAAGCCTGG CGAAATGGAA TCGCCCAAAT GGTACCAGTG GAGTGAAGAT
CCCAAGCAGC ACATCATCCT GATGGATCCA CTTGAACAGC TTTCGGAAGC CAACTGGATG
ACTCGGATTC GACAGAAGCT GGCGGGGGCC AGTTCGAGTG AAATTCTGGT CTTTGTACAT
GGCTACAACG TGACATTCGA GAATGCCGTG CGCCGGACGG CTCAGCTTTC TTATGATCTC
AATTTCCCGG GGGCAGCCGT CTGCTTCAGT TGGCCGGCAG GGAACAGCAT GGTTTACACG
ACTGACTGGA CCAATGCGGA ATGGTCGTTG CCGCATTGCC TGCATGTTCT CAAGCAGTTG
GCGTTGTTCT CGAGGGCTGA CAAGATTCAT ATTGTCGCTC ATAGCATGGG CAGCCGTGTG
GTGACTTTTT CACTCAAAGA GCTGCTGCGC GAGTTGCCAG TGCTCGATAA CCAGCCGCTT
TTTAATCAGG TGGTGCTGGC GGCTCCTGAT CTCGATGCTG AGATCTTCCG CACGCAGATC
GCACCGGCGA TTCAGAAGGC CTCCCGCAGG CTGACGATTT ACGCTTCTGA GCAGGATCTG
GCTTTGAAGC TCTCGCAAGG CCTCAATGGT GGCACTCGAC TGGGAACCGC CACACCTGTC
TCATTGACGA CGACGGGCTT CGAGTGGATC GACTGCATCG ATGCCACGGC GATGAGTCAG
GAGCCCATGA TGACCTTACA GCACGCCTAT TACGGCGATT CCCCCCGTAT GATCAGTGAT
CTGCGCCGAG TGCTGGCTGG TGAGAATGCG ACAATGCGTG GCCTGGTCTG TGAAAAGCCG
GGTCTCTTTC AGATTCGCTG A
 
Protein sequence
MSYFVSICLT TARRCGRVMI LCLVLAMTGC AKAPQDAKTA TDSSDKGFVV SPLTASGGPA 
GAADFPLDAG STGSAPSISD GPILPDLPAQ SEPAIAGSEA PPFPANEPGM ETVTVTVNGK
AAIEPKTIEF GEFDPSQAAA YEIQPRPAMS PTARQSTDSG FTRETIYFAT NRKRTGSTSA
KEAYSSHRGE LQWGMCQVSI PYKHKPGEME SPKWYQWSED PKQHIILMDP LEQLSEANWM
TRIRQKLAGA SSSEILVFVH GYNVTFENAV RRTAQLSYDL NFPGAAVCFS WPAGNSMVYT
TDWTNAEWSL PHCLHVLKQL ALFSRADKIH IVAHSMGSRV VTFSLKELLR ELPVLDNQPL
FNQVVLAAPD LDAEIFRTQI APAIQKASRR LTIYASEQDL ALKLSQGLNG GTRLGTATPV
SLTTTGFEWI DCIDATAMSQ EPMMTLQHAY YGDSPRMISD LRRVLAGENA TMRGLVCEKP
GLFQIR