Gene Plim_4167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_4167 
Symbol 
ID9140887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5332385 
End bp5334271 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content53% 
IMG OID 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_003632176 
Protein GI296124398 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATTTC GACAACTGAT CATTCCACCG ATCTACATTA TCCTGTACGG ATTCACATAT 
CTTCTGGCCT ACTCTCTGCG CTTCGATTTC GAAGTTCCCA GTGAGGTCTG GGGGAGATTT
CTAGCCACTT TTCCACTGGT GATTGCCACC AAGTCGCTGG TGAATCTATT GACTCGTCAA
TGGCGCAGGA AGCACCGTTA CACCTCGCTG GTTGATGTCA TCTACGTCAC CGGAGATGCC
TTTTTTGCAG CGACATTACT TTTGGCCATC AATGCCTTTC TGCCCTCGGG AATCGTGATT
CCGCGATCGA TTGTGTTGAT CGATCTGATG CTCACAGTTC TGGCAATTGC CGGCCTGCGT
TCGATGATTC GCAGCTATTG TGAGGTGATT CATCCTAAGC TCCACCGCAG GCTGGGTGGT
ATGTCGCAAC TGACTCCACG CCGGGCTTTG ATCTACGGTG CAGATGCCAG TGCAGTGGCA
ATTTTCCGCG CACTCAAAAG CGGCAACGAA GAGTATCGCA TCTGTGGTTT TATCGATCCT
GAAGGTTTCT CTCAGTCGAG CATTATTGGC GAAGCCGAAG TCTTCAGTGG CCAGATCGAT
CTTGCAAAAA TCGCTAAAAA AGTCCGAGCG GAAATGTTGC TCATACCGGC ATCGACGCCC
GGGCGAATCG TGCGAGAGTT ACTCGTGCAA TGTGATGACC AACATCTTCT GGCCCATGTG
ATCCCGGGTG TCGATGAAAT CGTCAATGGA CGCATTCGGC TGGCGACGCG GGAAGTGACC
ATCTCCGACC TGTTACGCCG CGAACCGACA AAGCTCGATT TTGAAAGTAT GCGGAGTTAC
ATCTCTCGTC GCCGTGTGCT GGTCACGGGA GCTGCGGGAA GTATTGGCTC AGAGCTGTGC
CGACAAATTC TGGCTCTGAA TCCGGAAAGT CTGATTCTGC TCGATCAATC CGAGCCTGGC
ATTTTTGCCA TGGAACAGGA GTTCCAGACG CGTACGACGG GTGGCACTCG ACTGGTTTAC
GAGATCGCCG ATATGCGCGA TCAACCCACG CTCGAACAGA TCTTCGATAC CTACAAGCCC
CAACTGGTTT TCCATGCAGC GGCATATAAG CATGTTCCTC TTATGGAAGC CAATCCGCAG
GAAGCGATCC GTAACAACAT CTTCGGTACA AAGGCACTGG TCGATGCTGC TGATCAATTT
GGCGTGGATC GCTTTGTGTT GATTTCGACC GATAAAGCTG TGCGTCCCAC CAACATCATG
GGCTCTACAA AACTCTTTGC CGAGAAATAT CTGCAGGCGA CTGCCCAGAA ATCCAAAACC
GAATTCATGA CGGTTCGCTT CGGGAATGTG CTCAACTCGG CTGGCAGTGT GGTACCGACA
TTCCGCCGAC AGATTCTCGA AGGTGGCCCG ATCACTGTCA CGCATCCCGA GATGGTGCGG
TTCTTCATGA CGATTCCTGA AGCCGTTCAA CTCGTGCTGC AGGCAGGGGC CATCGGACAA
ACCGGCGGCG TCATGATTCT GGATATGGGC GACCCGGTGA AAATTCTCGA TCTGGCCCGC
GATATGATCT ATCTTTCCGG GCTCAAGTAC CCGGACGATA TCGACATTGT CTTTACTGGT
CTTAGACCCG GCGAAAAGCT CTACGAAGAA CTCTTCTATG AATCCGAAGT CTCGGCCGAG
AAGATCCACG AAAAGATTTT CATGGCCCAC CGGGCTCCGA TTTCACCTCG TCTGGTGAAG
GAAGCTCTCG CTCGTCTGCA GTCAGCCGTC GAACTTTCCC GGGAAGCTGC CGCTGCGACT
TTGCGTGAGA TCACGGCTGA GTTCGTGGCA ATTGATGAGG GGACATCTTC CGAGCACATC
AGCCAGCCCA CTCGCAAAGC GGCTTAA
 
Protein sequence
MPFRQLIIPP IYIILYGFTY LLAYSLRFDF EVPSEVWGRF LATFPLVIAT KSLVNLLTRQ 
WRRKHRYTSL VDVIYVTGDA FFAATLLLAI NAFLPSGIVI PRSIVLIDLM LTVLAIAGLR
SMIRSYCEVI HPKLHRRLGG MSQLTPRRAL IYGADASAVA IFRALKSGNE EYRICGFIDP
EGFSQSSIIG EAEVFSGQID LAKIAKKVRA EMLLIPASTP GRIVRELLVQ CDDQHLLAHV
IPGVDEIVNG RIRLATREVT ISDLLRREPT KLDFESMRSY ISRRRVLVTG AAGSIGSELC
RQILALNPES LILLDQSEPG IFAMEQEFQT RTTGGTRLVY EIADMRDQPT LEQIFDTYKP
QLVFHAAAYK HVPLMEANPQ EAIRNNIFGT KALVDAADQF GVDRFVLIST DKAVRPTNIM
GSTKLFAEKY LQATAQKSKT EFMTVRFGNV LNSAGSVVPT FRRQILEGGP ITVTHPEMVR
FFMTIPEAVQ LVLQAGAIGQ TGGVMILDMG DPVKILDLAR DMIYLSGLKY PDDIDIVFTG
LRPGEKLYEE LFYESEVSAE KIHEKIFMAH RAPISPRLVK EALARLQSAV ELSREAAAAT
LREITAEFVA IDEGTSSEHI SQPTRKAA