Gene Plim_0154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_0154 
Symbol 
ID9136808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp205231 
End bp206832 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content53% 
IMG OID 
ProductHTTM domain protein 
Protein accessionYP_003628205 
Protein GI296120427 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCAGC TTTCCCATGA CAGTGCGATC GTTAACAGCA CACGTTCGAT CAGTGAACTT 
CTTCGGCAAC AGAAGGAGCA GTTGAATGAG TTCTTCTTTG CCAAAGAATC GCCAATTGGC
GTGGCACTGA CGAGAATCGT CATCTGTGCC ACAGTTTTCA TCGTCATGCT CGATCGATGG
AAGTACGTTC GCGAGATCTA CTCAACGGAT GGTGCTCCTG CGCAGATCAG TGTGAACTTC
GGATTTGGAG AACTCTTTCC TGTTTTTTCA GGGAGTGTGG TTGCAGCGCT CTTTGCCATC
ATGCTCTTTG CCCTGCTGAC GGCCATGGTG GGATGGAAGA CCCGCCTGTC GCTGATTGTC
GCCAATCTGC TGTTCATCTA TTTCTGCAAT ATCGACTACG TCACAACCTT GACCAAGTAT
TCGGTCATTG CAACCCATAT CCTGCTGCTA CTCACGCTTT CCCGTTGTGG TGATGTTTTC
TCTGTCGATG CGTGGCTCAA ACGCACTGCA CCAGCGAACC CCTGGCTGGG CCGGACCATT
GAAGATCTCC CTCAAGGTTA CGCCTGGCCC AGACGCTGCA TCCAAATCAT GATTGGTACG
GTCTACTTTG GTGCCGCCAT TACCAAAATT CACACGCCGA CATTTTTCTC TGGTGATCAA
CTCCAGTGGT GGATGTTGAC CGAACTCAAT TATGAGCATC CGGTCGGTGC CTTCATCAGC
ATGTATCCGG CTGTCATTGT CGTGATGTGC TACATTGCGG TGATCTGGGA AATCATGTTC
ATCGTGCTGG CGTGGCGTGG TGTCCCGCGG ATGATTTTCC TGACGCTGGG CGTAATCTTC
CATGCGGCCA CATTCTTCAC GCTGGGACTG CTTTCCTTCC CGCCGGTCTG CTTTGCCTGT
TATCTGGCCT TCATGAACGA TAACGATGCA CGATGGCTCG CCTCGCATGG CCGCTGGATC
ATGCGCAAAT TCCACCTGCG AAACTGGATA GCTCCGCTAA ATGCCACTGC GGCCATCAAG
GCGTTTTCGA TTCAATCACC ACAAGTCCAA ACATCTCCAG CAACGGGTTA TGCCCGGGTT
GTTCGTCAGA CAGGTCTTTG GGGAGCCTGC TGTGCCTGCC TCGCACTGAT GGGGGTTGCC
ACGGAATATC AGGTCGATCG TTATGGCGTT CGTCGTCCTG AGGGGCCGAT GGTTCTGGAG
CCTATGGATC AGGCTGTGGC CAAGAAGTTT CTATCTCCGG CACCTAAGTT TCGCGAAGTG
GATAAGTTTT TTGCGATCGA CGTCGGCACT CTTCTGGTGG CCGATCAGCT TGCGATCCGC
AAGCAGTACT ATCAGATTGG CGAAACCATG ATCGTCCAGT GCCAGCTTCT GCCACCCCAC
GAAGATATGT ATCTCGAATG CCTGATCCTT AATGAAGAGG GGCAGATCGA AGGGGTGCAG
GAACTGGTGG CCACTCGTGA AATGAACCGT GCGAACTTCA ACTGGCCGCT CTGTGAAAAT
GTCCAGAGTG GTCGCCATCA GATCGTGATT CGCTCGGCAG GCCAGGAGAT TGCCCGCCGA
ACATTCTTCG TCAATGGCGA GACTTGTGAC GTGAAAAAGT AA
 
Protein sequence
MSQLSHDSAI VNSTRSISEL LRQQKEQLNE FFFAKESPIG VALTRIVICA TVFIVMLDRW 
KYVREIYSTD GAPAQISVNF GFGELFPVFS GSVVAALFAI MLFALLTAMV GWKTRLSLIV
ANLLFIYFCN IDYVTTLTKY SVIATHILLL LTLSRCGDVF SVDAWLKRTA PANPWLGRTI
EDLPQGYAWP RRCIQIMIGT VYFGAAITKI HTPTFFSGDQ LQWWMLTELN YEHPVGAFIS
MYPAVIVVMC YIAVIWEIMF IVLAWRGVPR MIFLTLGVIF HAATFFTLGL LSFPPVCFAC
YLAFMNDNDA RWLASHGRWI MRKFHLRNWI APLNATAAIK AFSIQSPQVQ TSPATGYARV
VRQTGLWGAC CACLALMGVA TEYQVDRYGV RRPEGPMVLE PMDQAVAKKF LSPAPKFREV
DKFFAIDVGT LLVADQLAIR KQYYQIGETM IVQCQLLPPH EDMYLECLIL NEEGQIEGVQ
ELVATREMNR ANFNWPLCEN VQSGRHQIVI RSAGQEIARR TFFVNGETCD VKK