Gene Cphamn1_1725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1725 
Symbol 
ID6375412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1869447 
End bp1870742 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content50% 
IMG OID642684218 
Productproteinase inhibitor I4 serpin 
Protein accessionYP_001960124 
Protein GI189500654 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4826] Serine protease inhibitor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATTA TTATTTATAA CCGCTTAATC CCTGCCGCCA TGTATACAAG AAAAGCCTTG 
CACCTGATTC CTCCGCTCTT CCTGCTCATG GCGCTCTTTT CTTTCAGCCA GCCGGCAGCG
CCCGGAAAAA CAGAGAAACC GGAACCGATG CAATCAACCG AAGCTGAAAA AAGAGAGAAT
ACAGCATTTG CGCTGGATCT CTATCACACC ATCGGAAAAA GCACGAAAGG CAACATCTTC
TTTTCGCCCT ATAGTATCTC ATCGGCACTC TCAATGACCC TTTCCGGCGC AGCCGGGACT
ACCGCACGTC AGATGGGCGA TATGCTCTAC GCTCCTGAAG ATCTACAGCG ATACCATCAC
ACCCGGGCAT CGAGAGAAGC AGAGATCAGC AGCATCGCAA AAAAAGGAAG CGTCACGCTC
GAAACCGCCA ACGGACTCTT TCCCCAGAAA GGCTATGAGC TGTCAAAAAC ATTTGTTCAG
GAACTGCTCA CCGTCTACCG TTCGACGCTG ACTCCGGTTG ACTACCGGGC GGACACTGAA
AAAGCGAGAA AAACCATCAA CAGATGGGCG GAACAAAGAA CCCGAAACAC CATCCGTGAG
CTCATACAGC CGGGAATTCT CAACACCCTT TCAAGACTTA CCCTTGTCAA CGCGATATAT
TTCAAAGGTA ACTGGAAAAA AGCTTTTGCC GAGTCAGAGA CCACTGCCGC CGATTTTTAT
ACCGACAAGA GTTCAACGTC AACAGTCTCC ATGATGCATC AGGAAAACAC CTTCCCTTAC
ACGCTCTCAG ATTCCCTTCA GATTCTTGAA CTCCCCTACT CCGGAGAAGA TATTTCAATG
CTCGTCCTGC TGCCTGAAAA AAACAAAGGA CTTGCCGGAC TCGAAGCCGA TCTTACAGCT
GAAAAGCTCT CACTCTGGAC AGATTCCCTC AAACCGCAGA AGGTAAGGGT ATTCCTCCCG
AAATTCACCA TGTCATCAAC CCTGCGTCTT GATGATTCCC TGAAAAAACT TGGAATGACT
GACGCGTTTG ATCCCGGGAG AGCAGATTTC TCACCAATGA CCGTCAATAA GGATAAACTT
TTCATCGGAG CAGTCGTTCA CAAGGCTTTT GTCGATATCA ACGAAACGGG AACCGAAGCC
TCTGCGGCAA CCGGCGTCGT AGTCGGCCTG ACATCCGCTG TTCAGGCACC GACGCCCGTT
TTCAGGGCCG ATCATCCATT TCTTGTTCTT ATCAGGTCGA ACCGTTCCGG CTCAATTCTG
TTCATGGGAA GAGTTTCCGA ACCGGATAAT GACTGA
 
Protein sequence
MRIIIYNRLI PAAMYTRKAL HLIPPLFLLM ALFSFSQPAA PGKTEKPEPM QSTEAEKREN 
TAFALDLYHT IGKSTKGNIF FSPYSISSAL SMTLSGAAGT TARQMGDMLY APEDLQRYHH
TRASREAEIS SIAKKGSVTL ETANGLFPQK GYELSKTFVQ ELLTVYRSTL TPVDYRADTE
KARKTINRWA EQRTRNTIRE LIQPGILNTL SRLTLVNAIY FKGNWKKAFA ESETTAADFY
TDKSSTSTVS MMHQENTFPY TLSDSLQILE LPYSGEDISM LVLLPEKNKG LAGLEADLTA
EKLSLWTDSL KPQKVRVFLP KFTMSSTLRL DDSLKKLGMT DAFDPGRADF SPMTVNKDKL
FIGAVVHKAF VDINETGTEA SAATGVVVGL TSAVQAPTPV FRADHPFLVL IRSNRSGSIL
FMGRVSEPDN D