Gene Plim_3168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3168 
Symbol 
ID9139882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4100337 
End bp4102322 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content55% 
IMG OID 
ProductSpore coat protein CotH 
Protein accessionYP_003631182 
Protein GI296123404 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.308168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGTCG GGAGCGTTCT ATTATTGGTG ATTGCCATCG CGATTGAATC GGTCTGGCCA 
CAGACGGGTG ATCGCGCTCC GGAAAATGGC CCACCCCAAA ATGGCCCCCG AGAAAATGGC
CCAGGGGGCG GCTTTCCACC CTTCGGCCCT CCGGGCGGAT TCCCTGGCTT TGGCGGACCA
CCCGGCTTCG GCGGCCCGCC CGGAGGTGGA CAGGAACGCA AGCTGCTCTC GAAATTCGAT
GCTGACAAAG ACGGCAAGCT GAATCTCGCA GAGCGACAAC TGGCTCGAAA AGAATCCGCT
CAGGGAAGTG CTGGCTTCGG CGGTCCCAGA GGGCCTCGCG GCGGTATGCC TGGCATGGGA
GCCAATCGTA CGCCACAAGC AGGGAAGAAA ATCCTGCCCG AGAATGTGAC GTCAGCCGGT
GATACTGATC TGTACGATCC GTCGATCGTC CGGACAATAT TTCTGAACTT TGAAGGCAAA
GACTGGGAAA CAGAACTCTC GGACTTTCAC AACACGGATG TCGAAGTTCC CGCCATGATG
CAGGTTGATG GTAAAGACTA CCCCGACGTC GGCGTCAGCT TCCGCGGCAT GTCATCCTAC
GGTATGGTGC CAGCCGGCTT TAAGCGGTCA TTCAATGTTT CCATCGATGC CTTTAACGAC
CAGCAAAAGC TCGGCGGCTA CAAAACGCTG AATTTACTTA ACTGCAATGG CGACACTTCA
TTTTTAAGAG GTTTTGTCTA CTCTCAGATC GCCACGGAAA TGATCCCCGT CCCTCGCGTG
AACTTTGTTC GTGTCGTTGT AAATCACGAA GACTGGGGTG TCTTTGCAAA CGTCGAACAA
TTCAACAAAG ATTTTATCAA AAGACACTTT GAGAACAGCA ATGGCTATCG GTGGAAGGTT
CCAGGAAGTC CTATGGGTCG CGGTGGGCTG GAGTATTTAG GGGATGATAC CAACGCCTAC
AAGCGAATTT ACGAGATCAA AAGTAAAGAT ACTCCTGAGG CCTGGGAGCG ATTGATTTCT
TTATGCCGCA TTCTGAATGA GACACCTGCG GAGCAACTGG TCGAAAAGCT GGAGCCAGTC
CTCGATATTG ATGAGACCCT GACGTTTCTG GCGCTGGATG TCGCGCTCTG CAATAGTGAT
GGCTACTGGA CTCGCGCGAG TGATTACAGT CTCTACTGCA CACCCGAAGG GAAATTTACA
CTGGTTCCGC ACGACTTCAA TGAGATCTTT CAATCGGGTG GCCCTGGCGG CCCACCAGGT
GGCGGACCGC CGGGTGGATT TGGGCCCCCA TCATTTGGTT TTCCTCCATT TGGTCCACCA
CCCGAAGGCC AACCCCCATT CGGACCGCCG GGTGGTGCAC CCAATGGATT CGGGCCACCT
CCCAATGGGA ATAGCACTCC TCCTCAGGGA CGAGTTGCCG GTAATCCCAA TGGCCCTCCT
CAAGGGCCGG GTGGCGGGCA AAATCCGCGG GGTGGCCCAA GACGAGGCCC CGGTGGTGGT
GGGCCTGGTG GCGGTGGTCC CGGTGGCGGT GGGCCAGGTC ATGGTGGGCC GACACTCGAT
CCTCTGGTTG GGCTCAACGA TTCGACAAAG CCACTGCGCA GTAAACTCCT TGCTGTTCCA
GAACTCAAGG CTCGCTATCT GAAGTATGTC GGACAGATTG CCGATCAGTA CCTGGCAGCC
GAATTCCTCA AACCTCGGAT GCAGCAGGAG TTTGAACTGA TTTCACCACT GGTGGCTCAG
GATCAGAAGA AGCTCTTCAC CACAGCCGAC TTTGTGCGCG AGTCGAAGTT TATTGAGATT
CAGAACTCCT CAGAAAATGC TCGCTCGACG CTGTGGGATC AGATTCAGAA ACGACGGGAG
TTTCTGATGA AACACGCGGA AGTCCGAGCC GCCCTTGGCA AGCAGAACAC CGATGGACGA
ACATCGGCGA TCAAGAATCA GCGATCCTCC AACCGACAAC CGGCTCCCTT GTCGTCAGCC
CGTTAA
 
Protein sequence
MVVGSVLLLV IAIAIESVWP QTGDRAPENG PPQNGPRENG PGGGFPPFGP PGGFPGFGGP 
PGFGGPPGGG QERKLLSKFD ADKDGKLNLA ERQLARKESA QGSAGFGGPR GPRGGMPGMG
ANRTPQAGKK ILPENVTSAG DTDLYDPSIV RTIFLNFEGK DWETELSDFH NTDVEVPAMM
QVDGKDYPDV GVSFRGMSSY GMVPAGFKRS FNVSIDAFND QQKLGGYKTL NLLNCNGDTS
FLRGFVYSQI ATEMIPVPRV NFVRVVVNHE DWGVFANVEQ FNKDFIKRHF ENSNGYRWKV
PGSPMGRGGL EYLGDDTNAY KRIYEIKSKD TPEAWERLIS LCRILNETPA EQLVEKLEPV
LDIDETLTFL ALDVALCNSD GYWTRASDYS LYCTPEGKFT LVPHDFNEIF QSGGPGGPPG
GGPPGGFGPP SFGFPPFGPP PEGQPPFGPP GGAPNGFGPP PNGNSTPPQG RVAGNPNGPP
QGPGGGQNPR GGPRRGPGGG GPGGGGPGGG GPGHGGPTLD PLVGLNDSTK PLRSKLLAVP
ELKARYLKYV GQIADQYLAA EFLKPRMQQE FELISPLVAQ DQKKLFTTAD FVRESKFIEI
QNSSENARST LWDQIQKRRE FLMKHAEVRA ALGKQNTDGR TSAIKNQRSS NRQPAPLSSA
R