Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_4253 |
Symbol | |
ID | 9122183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014149 |
Strand | + |
Start bp | 7588 |
End bp | 8910 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | phage major capsid protein, HK97 family |
Protein accession | YP_003632259 |
Protein GI | 296051585 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 729 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCACGA TGACGATGAA AGAACTTCAG GAAAAGCGGA ACAACCTGAT TGCTGAATCC CGAAAGATTC TCGACAAGGC CGATTCCGAA AAGCGAGTGA TGGATTCTGC TGAGAAGGCC CAGTGGGACA AGATGAATGC CGAGATAAGT GAACTCGGCG CTCACATGCG CCGCATGAAT CGGCAGAAGA AAAACGAAGC AAGAACGAAG GAGCTTTTGG AGAACAACGA TCTGGGTCCC AGCCAGGTCG GCAATCGTAA CAATCGCAAG CCAGGCAACG AGCCAGTCAA CTTCAATGAT CTGGTGTCTG GCTGGGTGAG GATGGCCACG GGCCGCCGTA TCACAAATGC CCAGAAGAAA GCTCTTCAAA ACCGCTCCGC AAAGATCAAT GGTCGCGAGG CCTATATTCC GCTTCACAAG AACTTCAACG AAGTAAAGCG GGCTGTCAAC GTTCTGACCA CTGGAACTGG TTCCAGTGGT GGCTACACGA TTCCCGAAGG GTTTATTGCC GCCCTTGAAG TTGCCATGCT CACTTTCGAT CCGGTCTCCG CGGTCGCAGA TGTCTGGCGT ACAAGTGCCG GTAATCCTAC TCCCTGGCCA ACTGGTTCTG ACACAGCGAA CTCAGGGGAG CAGTTGGGAG AATCGACGTC GTTTGGCGCA AGCGTAGACC CCACTATGGG CGTGGTGCTG TTCGGTGCCT ACAAGTTCAG CTCAAAGCCA ATCATGGTTC CTTACGAGCT GCTTGAAGAC ACTGGTGTCA ACCTGACTCA GCAACTCGGT ACCTGGTTGG GTGAACGACT CGGTCGGATC AAGGCTCTGC GAAACACGCT GGGCAACGGG ACGACTCAAC CCCAAGGGAT TGTCACCGGT TCAGTGCTGG GTCACACCGC CGCGGATGAT GTGACAATCA GCTTCGACGA CGTGATCAGG CTGGAGCACT CGGTGCCTCG CGCCTACCGA ATGAATGCCG GGTACATGTG CAACGATGCC GTGTCGCTGG CATTGCGTCT CTTGAAGGAT TCGCAAGGGC GTTACCTGTG GCAGGCCTCG GCCAATGCCG GGATGCCAGA CCTGCTCAAC AACCGGCCGC TGACGATCAA CGACAACATG ACCGGCACGA TTGCAGCCAG TGCCAAGACC ATGCTGTGCG GTGATTTCAG CAAGTTCAAG ATTCGCGAAG TCGGCACCAT CCGATTGAAG CGGCTGCAGG AACGCTATGC CGAACTGGAT CAAGAAGCCT TCATTGGCTT CGAACGCATG GACTCCAAAG TCCTTGATGC CGGTGCCGGT CCGATCCGTC ACCTGATTCA GGCTGCCAGC TAA
|
Protein sequence | MPTMTMKELQ EKRNNLIAES RKILDKADSE KRVMDSAEKA QWDKMNAEIS ELGAHMRRMN RQKKNEARTK ELLENNDLGP SQVGNRNNRK PGNEPVNFND LVSGWVRMAT GRRITNAQKK ALQNRSAKIN GREAYIPLHK NFNEVKRAVN VLTTGTGSSG GYTIPEGFIA ALEVAMLTFD PVSAVADVWR TSAGNPTPWP TGSDTANSGE QLGESTSFGA SVDPTMGVVL FGAYKFSSKP IMVPYELLED TGVNLTQQLG TWLGERLGRI KALRNTLGNG TTQPQGIVTG SVLGHTAADD VTISFDDVIR LEHSVPRAYR MNAGYMCNDA VSLALRLLKD SQGRYLWQAS ANAGMPDLLN NRPLTINDNM TGTIAASAKT MLCGDFSKFK IREVGTIRLK RLQERYAELD QEAFIGFERM DSKVLDAGAG PIRHLIQAAS
|
| |