Gene Clim_1363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1363 
Symbol 
ID6353773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1463530 
End bp1464804 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content55% 
IMG OID642668972 
Productproteinase inhibitor I4 serpin 
Protein accessionYP_001943402 
Protein GI189346873 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4826] Serine protease inhibitor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.525887 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCACAA CGGATACTCA TGAAAAACAT AACTTTCCCA TGAACAACAC GATTGCATTG 
CTCTCCCTGC TCGGCGTCAT GCTTACCGCA CCGCTCTCCG GCGCATCCGG AGCGACCGGT
CTGCCTGAAA ACCATGCCGC ATCTCCGGAA GCACAAAACG AACTGGCCGT CGATCTGTAC
CGCAATCTTG CAGTTACCGG AAAAAACCTC TTTTTCTCCC CCTCCAGCAT CGAAACCGCG
CTTTCCATGA CCATGTCCGG AGCGCGAAAC CGGACGGAAC GGCAGATGGC CGATGTAATG
CATGTCGGCC CTGACGCCAT GGAACGCCAC CATGCCGGAC TCGCATCGTT CGAAAAACAG
CTTGAGTCCA TTCAGAAAAA AGGGAAGGTA ACGATAGCCT CCTCGAACTC GATCTGGCCG
CAGAAGAACT ATCCGCTTGC GCCTTCATGG CTTGCGCAGC TCAAACGGTA CTACGGAACA
TCGGTAACGC CGGTCGATTA CATCCATGAG ACGGAAAAAG CGCGGATCGC TATCAACCGG
CGAGTGGAAA AGGATACGAA AAACCGGATC CGGGAGCTTC TCAAACCCGG TATTCTCGAT
CCCCTGACAA GACTCGCGCT GGTCAATGCA GTCTATTTCA AAGGCGATTG GGAGCACCCG
TTCAATGAAA ACAACACGGT TGCATCCCCG TTTTACATCC GCCAGGGAAC GACAGGCAAA
GCCCCGCTGA TGCGGCAGAG TGCATCGTTC GGTTACGGCG ATCATGACGG GGTGCAGGTG
CTCGAACTTC CCTATGCCGG AAAAAAGCTC TCCATGATCG TGGTACTGCC GAAAGAACGG
TTCGGCCTCG AAGCTCTTGA AAAAACCCTG ACTCCGAAGC AGTTTGCCCT CTGGACGGCT
AATCTCAGCG AGAGAAAAAT CGAAGCGCTT CTTCCCAAAT TCCGCACCAC CTCAGCGTTC
CGCCTCGACG AGACTCTCAG GCATATGGGA ATGACCGATG CATTCGACAG GAATCTCGCC
GATTTCAGCG GCATGGTATC CAATAGCGAC AAACTGTACA TCGGTGCGGT CGTCCACAAG
GCTTTCGTGG ATGTCGGCGA AAAAGGCACC GAAGCTGCGG CAGCGACAGC CGTAGTCATG
CAGCTTCGGA GCGCAATGCC GATGCCGGTA CCGGTATTCA AGGCCGACCA CCCATTCCTC
TTTGCCATAC GGGAGAACAG CACGGGCCGC ATCCTTTTCA TGGGACGCAT TTCCGACCCT
GCAGATAACG GATAG
 
Protein sequence
MPTTDTHEKH NFPMNNTIAL LSLLGVMLTA PLSGASGATG LPENHAASPE AQNELAVDLY 
RNLAVTGKNL FFSPSSIETA LSMTMSGARN RTERQMADVM HVGPDAMERH HAGLASFEKQ
LESIQKKGKV TIASSNSIWP QKNYPLAPSW LAQLKRYYGT SVTPVDYIHE TEKARIAINR
RVEKDTKNRI RELLKPGILD PLTRLALVNA VYFKGDWEHP FNENNTVASP FYIRQGTTGK
APLMRQSASF GYGDHDGVQV LELPYAGKKL SMIVVLPKER FGLEALEKTL TPKQFALWTA
NLSERKIEAL LPKFRTTSAF RLDETLRHMG MTDAFDRNLA DFSGMVSNSD KLYIGAVVHK
AFVDVGEKGT EAAAATAVVM QLRSAMPMPV PVFKADHPFL FAIRENSTGR ILFMGRISDP
ADNG