Gene Plim_4251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_4251 
Symbol 
ID9122181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014149 
Strand
Start bp5179 
End bp6813 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content57% 
IMG OID 
Productphage portal protein, HK97 family 
Protein accessionYP_003632257 
Protein GI296051583 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones685 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACCCTT TTAACTTCGC CGTCAACACG TCGCAGGTGA CCAACACGGA AGTCACTGGC 
AATCCCGAGA ATCCCGCAGT GCCCCTCGAT CCGGATGCGT GGGAGGATCT CGGATGGCTC
GGCCGCATGA CCGATGCCGG CGTACGGGTG AATCACGAAA ACTGCATCGG TCACCCTCCA
CTTTGGCGAG CGGTCAACCT GATTGCCGGA GACGTGGCCA AGCTCCCACT CAATGTCTAT
CGCCGAACTC AAGGCGGTCA CGAGCGGGAC AAAGATCACC ACGCCAACTG GCTGCTGAGG
CACAAGGTCA ACGACTACAT CCTCAGTTAT GTGTTCAAGA AAACCTTGAC GTTCCACTCG
CTGTTGCATG GGCAGGGATT CGCCTACATC CATCGCGAGG GCACAGTTCC CAAGCAACTC
GTACCGCTGA ATCCTGAGAC AACAGAGCCT GTCGTCATCG GCGGGAACTT GTTCTATGGC
GTGACCATCA ACGGAAATAA GCGACGACTG CACGCGACGG ATGTGCTGCA CATCAAAGGG
ATCTCGTTCG ATGGCATCGA AGGCATTGAT GTCATTCAGG TCATGGCTCA GGCACTGGCT
TTGGGAATCG CAGCCCAGAA GTACGGGGCC AAGTTTTTCG GCTCCGGTGC TAAGGCGGGC
GGAATCCTGA TGCTGCCTCC GGGCCTGAAG CCGTCTGCCC GAAAGCAACG TGAGGAGGAG
TGGAAGAAAG AGCAGGAAGG CCTCGACAAT GCCTTCAAGT CCATGCTGCT GGAAGATGGT
GCCAAGTGGA TCCAGAACAC GATCACACCT GAGCAGGGTC AGTTCCTGCA GACCCGACAG
TTCGAGGCCA TTCAAGTCGC CAATGTTCTT GGGGTGCCAC CACACAAGAT CGGCGACAAT
GGCAAGGTCT CTTACAACTC GCTTGAGCAG GAGAACAAGA GTTACCTGCA GGACTGCCTC
GATCACTGGC TGGTCATGTG GGAGCAGGAA TGTTGGGACA AACTTCTCAC CGAGAACGAG
AAGCGAAACG ACACGCACTT TGTCGAGTTC AATCGGTCGG CATTGCAGCG AACGGATGAG
AAGACAGAAG TCGAATCACT GGAGAGACAG GTCAACTGCG GGATGCTGCT GCTCAATGAA
GCGCGGGCGA TCCGCAACCT GCCACCAGTG GAAGGCGGCG ATCGGCCACG CATGCCCTCG
AACATCATCT TCACCGATGC GAAGCAGCTC GAAGCGAAAC CTGCGGCTGC CACCAATGCT
GGGGAGAGAA TCGCTCAGTC GATCCAGAAC GCACTCAATG ATCGGATCGA TCGACTGCTC
GAAGTGGAGC AGGCGGTTCG ATCAAAGGCG AACTGCTCGA TCGATCTGCT GACTCAGCAT
GGTCGCAAGC TGGCCAGTGC TGTGATTCCT GTGCTGGATA TTTATGCCGC CAGCCAGGGC
AAGATCCTTG ACGCTGACAA GACCACGGCT GCCATCGTGG CCGATTGTGT CAGCTATCAG
GCTGCAGAGA ACTGGCACAC GTCTCGACGC AATCATCTCG CCACCTTTTT GAAAGGCCTC
GTGAATGGAT CCCATCCTGA AGAAACTGGA AGAGCTGCGG GCGAAGCAGA ACAAGCTGGC
GAAACCCAGC CGTAA
 
Protein sequence
MNPFNFAVNT SQVTNTEVTG NPENPAVPLD PDAWEDLGWL GRMTDAGVRV NHENCIGHPP 
LWRAVNLIAG DVAKLPLNVY RRTQGGHERD KDHHANWLLR HKVNDYILSY VFKKTLTFHS
LLHGQGFAYI HREGTVPKQL VPLNPETTEP VVIGGNLFYG VTINGNKRRL HATDVLHIKG
ISFDGIEGID VIQVMAQALA LGIAAQKYGA KFFGSGAKAG GILMLPPGLK PSARKQREEE
WKKEQEGLDN AFKSMLLEDG AKWIQNTITP EQGQFLQTRQ FEAIQVANVL GVPPHKIGDN
GKVSYNSLEQ ENKSYLQDCL DHWLVMWEQE CWDKLLTENE KRNDTHFVEF NRSALQRTDE
KTEVESLERQ VNCGMLLLNE ARAIRNLPPV EGGDRPRMPS NIIFTDAKQL EAKPAAATNA
GERIAQSIQN ALNDRIDRLL EVEQAVRSKA NCSIDLLTQH GRKLASAVIP VLDIYAASQG
KILDADKTTA AIVADCVSYQ AAENWHTSRR NHLATFLKGL VNGSHPEETG RAAGEAEQAG
ETQP