Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1725 |
Symbol | |
ID | 6375412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1869447 |
End bp | 1870742 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642684218 |
Product | proteinase inhibitor I4 serpin |
Protein accession | YP_001960124 |
Protein GI | 189500654 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4826] Serine protease inhibitor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTATTA TTATTTATAA CCGCTTAATC CCTGCCGCCA TGTATACAAG AAAAGCCTTG CACCTGATTC CTCCGCTCTT CCTGCTCATG GCGCTCTTTT CTTTCAGCCA GCCGGCAGCG CCCGGAAAAA CAGAGAAACC GGAACCGATG CAATCAACCG AAGCTGAAAA AAGAGAGAAT ACAGCATTTG CGCTGGATCT CTATCACACC ATCGGAAAAA GCACGAAAGG CAACATCTTC TTTTCGCCCT ATAGTATCTC ATCGGCACTC TCAATGACCC TTTCCGGCGC AGCCGGGACT ACCGCACGTC AGATGGGCGA TATGCTCTAC GCTCCTGAAG ATCTACAGCG ATACCATCAC ACCCGGGCAT CGAGAGAAGC AGAGATCAGC AGCATCGCAA AAAAAGGAAG CGTCACGCTC GAAACCGCCA ACGGACTCTT TCCCCAGAAA GGCTATGAGC TGTCAAAAAC ATTTGTTCAG GAACTGCTCA CCGTCTACCG TTCGACGCTG ACTCCGGTTG ACTACCGGGC GGACACTGAA AAAGCGAGAA AAACCATCAA CAGATGGGCG GAACAAAGAA CCCGAAACAC CATCCGTGAG CTCATACAGC CGGGAATTCT CAACACCCTT TCAAGACTTA CCCTTGTCAA CGCGATATAT TTCAAAGGTA ACTGGAAAAA AGCTTTTGCC GAGTCAGAGA CCACTGCCGC CGATTTTTAT ACCGACAAGA GTTCAACGTC AACAGTCTCC ATGATGCATC AGGAAAACAC CTTCCCTTAC ACGCTCTCAG ATTCCCTTCA GATTCTTGAA CTCCCCTACT CCGGAGAAGA TATTTCAATG CTCGTCCTGC TGCCTGAAAA AAACAAAGGA CTTGCCGGAC TCGAAGCCGA TCTTACAGCT GAAAAGCTCT CACTCTGGAC AGATTCCCTC AAACCGCAGA AGGTAAGGGT ATTCCTCCCG AAATTCACCA TGTCATCAAC CCTGCGTCTT GATGATTCCC TGAAAAAACT TGGAATGACT GACGCGTTTG ATCCCGGGAG AGCAGATTTC TCACCAATGA CCGTCAATAA GGATAAACTT TTCATCGGAG CAGTCGTTCA CAAGGCTTTT GTCGATATCA ACGAAACGGG AACCGAAGCC TCTGCGGCAA CCGGCGTCGT AGTCGGCCTG ACATCCGCTG TTCAGGCACC GACGCCCGTT TTCAGGGCCG ATCATCCATT TCTTGTTCTT ATCAGGTCGA ACCGTTCCGG CTCAATTCTG TTCATGGGAA GAGTTTCCGA ACCGGATAAT GACTGA
|
Protein sequence | MRIIIYNRLI PAAMYTRKAL HLIPPLFLLM ALFSFSQPAA PGKTEKPEPM QSTEAEKREN TAFALDLYHT IGKSTKGNIF FSPYSISSAL SMTLSGAAGT TARQMGDMLY APEDLQRYHH TRASREAEIS SIAKKGSVTL ETANGLFPQK GYELSKTFVQ ELLTVYRSTL TPVDYRADTE KARKTINRWA EQRTRNTIRE LIQPGILNTL SRLTLVNAIY FKGNWKKAFA ESETTAADFY TDKSSTSTVS MMHQENTFPY TLSDSLQILE LPYSGEDISM LVLLPEKNKG LAGLEADLTA EKLSLWTDSL KPQKVRVFLP KFTMSSTLRL DDSLKKLGMT DAFDPGRADF SPMTVNKDKL FIGAVVHKAF VDINETGTEA SAATGVVVGL TSAVQAPTPV FRADHPFLVL IRSNRSGSIL FMGRVSEPDN D
|
| |