Gene Plim_2067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_2067 
Symbol 
ID9138770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2679833 
End bp2680948 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content52% 
IMG OID 
ProductNHL repeat containing protein 
Protein accessionYP_003630093 
Protein GI296122315 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTTGC CTATTTCTCG ACGCGATGCA TTGAAGACCG CCTGTGTGGC CACTGCCGGT 
TTGATTTCAG GGGCTCCGTT CGTGCATGCT CAAAGTAAAG CGGATTCTCA GCCGCTGACC
ACTGGGCAAG GCGATTATCA GTACGCAGTG ATTCACAACT GGGCTCAACT TCCCAGCGAG
TTTACGTGGC AGACAACTCA GGCCGTGGTG GTGGATAAGA ATGGCTTTGT CTATCTGAAT
CACACGGGTG ACTTCAACAA GAAGAATCAC CCGAATGTCT TCGTCTTCGA TCAGGACGGA
AAGTACGTCC GCTCCTTCGG CCAATACTTT CAGGGCGGCG CTCACGGGCT GGAACTTCGT
GAAGAGAATG GTGAGGAGTT TCTTTACTTC TCGAATCCCG ACCCGGTGCA GTCGATTGCT
AAGACCAACC TCAAGGGGGA ACTGATCTGG GAACGGTTTG CACCCATGGA ATCGGGCATT
TATCCCGAGG GTGAGAACAC CTTGCCACAG CGTGTTCGTT CTGGTGAAGT TCCTCGCAAA
GGAGTGGGTG GACCCAATCG CTATAAGCCG ACGAACATTG CCTTTTTGAA GGATGGCGAT
CTGCTGGTGG CCGATGGATA TGGTTCCAAT TACATCCATC GCTATACAAA AGACGGCGAG
TACAAATTGA GCTTTGCTGG TGCCGGCCCC AGTCCGGGAA AGTTCAGCAC CAACCATGGT
CTGGCCTTGG AAGCACGACC CGGGAAAGAA GAAATTCTCT ATGTGACTGA CCGCAGTCGC
AACACGATTC AATGCCTGAC GCCTGAGGGC AAATTTGTTT CACTCATCGA CGGTTTTCAG
AAGCCCTGCC ACGTCGATTT CTACAAGGAT CTGATGCTGG TGCCAGAGCT TCAGGGTCGT
GTGACGCTCC TGGATGGCAA CAACAAGGTT CTGGCTTATC TGTGCGATGA CCATCAGAAC
GTCAATGCCG GTAAGGTGAA TCGAGGTGAC GCCAAGCAGT GGGCACCCGG CAAATTTGTC
CATCCGCACG ATGCCACTTT TGACCATAAT GGCAACATTA TTGTCAGTGA ATGGGTCACA
ACGGGACGGA TTACCTTACT CAAGAAATTG AGTTAA
 
Protein sequence
MALPISRRDA LKTACVATAG LISGAPFVHA QSKADSQPLT TGQGDYQYAV IHNWAQLPSE 
FTWQTTQAVV VDKNGFVYLN HTGDFNKKNH PNVFVFDQDG KYVRSFGQYF QGGAHGLELR
EENGEEFLYF SNPDPVQSIA KTNLKGELIW ERFAPMESGI YPEGENTLPQ RVRSGEVPRK
GVGGPNRYKP TNIAFLKDGD LLVADGYGSN YIHRYTKDGE YKLSFAGAGP SPGKFSTNHG
LALEARPGKE EILYVTDRSR NTIQCLTPEG KFVSLIDGFQ KPCHVDFYKD LMLVPELQGR
VTLLDGNNKV LAYLCDDHQN VNAGKVNRGD AKQWAPGKFV HPHDATFDHN GNIIVSEWVT
TGRITLLKKL S