Gene Caci_0104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0104 
Symbol 
ID8331429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp107482 
End bp108756 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content68% 
IMG OID644953271 
Productproteinase inhibitor I4 serpin 
Protein accessionYP_003110900 
Protein GI256389336 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4826] Serine protease inhibitor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00414663 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0601662 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACGCA AGCAGATAGC CGCGATCACC GCCGCATCCC TCGCCGCCCT GACGGCGTGC 
TCGTCCTCCG GAAGCTCTTC GTCGGCGGGC GGCTCCCTGG TGCGCGTCAA CGCCGCGCGG
GCGACCGTCG GGCAGGCCGA TCTGAGCGCC GCCTCGGCCG GCGTGAACGC GTTCGGCGTC
GACGTCTTCC ACTCCGTCGC CGACGGCGAC GAGGGCAACG TCATGATCTC GCCGACCAGC
CTGGCCACGG TGCTGACGAT GCTGCTGCCC GGCGCGAAGG GGCAGACCGA GGCCCAGATG
GCCAAGGCCC TGCACACCAC CATGACCGCC GACCAGTTCG CCGACGCGCT CGGCGCCCTG
GACTCGGCGA CCGTGCAGCG CGAGCTCGCC GACAAGGCAG AGCTGCAGCA GTACGACACC
GTGTGGACGC AGAAGGGATA CGACATTCAG CCGTCCTATC TCCAGACCCT CGCCTCGGCC
TTCGACGCCG GCGTCCGCGA GACCGACTTC ACGAACTCCG AGAGCGCGCG CCAGACGATC
AACAAGACCG TCGAGGATCA GACCAACGGA CTCATCAAGG ACCTCTTCGG TCCGGGCTCC
ATCAGTCCCG CGACCCGGCT CGCCCTCACC GATTCCCTAT ATCTCAAGGC GAAGTGGGCC
GACGCGTTCG CCAAGAGCGC GACCTCGGAC AAGCCGTTCC ACCTGACGAA CGGCAGCACC
GCGAACGTCC CGACGATGGG CAAGGAGCAC AGCTTCGCCT ACGCCCACGG CGCCGACTGG
CAGTACGCGG AGCTGCCGTA CCAGCAAGAT CACCTGGCGA TGGGCATCCT TCTCCCGGCA
TCCGGGTCCT TCGACACCTT CCGCAAGTCG CTCACCGGCG ACAGCCTCGC CACGATGACC
GCCTCGGCCA CGCCGTCCCC GGTCGACCTC GAGCTGCCGA AGTTCACGTT CGACACCAGC
CGCGATCTGA AGTCTCCGCT GGAGTCCCTG GGCATGCAGA CTGTCTTCGA CCCGAACTCG
GCGGACCTGA GCGGGGTCCC GGCCAAACCC GAGTCGCTGT TCGTCGGAGC CGTCGTGCAG
AAGACCCACG TCGCCGTCGA CGAGGACGGC ACCACAGCGG CCGCGGCCTC GGGCGTCACC
GTCGTCGCCG GCGCCGCGCC GCAGCAGTCC CCGCCGGCGG AAATGCACGT CGACCGGCCT
TTCCTCTTCT TGATCAGGGA CACCGTGACG GGCCAGATCC TGTTCCTGGG CCAGGTGAGC
GACCCACGCG GCTGA
 
Protein sequence
MPRKQIAAIT AASLAALTAC SSSGSSSSAG GSLVRVNAAR ATVGQADLSA ASAGVNAFGV 
DVFHSVADGD EGNVMISPTS LATVLTMLLP GAKGQTEAQM AKALHTTMTA DQFADALGAL
DSATVQRELA DKAELQQYDT VWTQKGYDIQ PSYLQTLASA FDAGVRETDF TNSESARQTI
NKTVEDQTNG LIKDLFGPGS ISPATRLALT DSLYLKAKWA DAFAKSATSD KPFHLTNGST
ANVPTMGKEH SFAYAHGADW QYAELPYQQD HLAMGILLPA SGSFDTFRKS LTGDSLATMT
ASATPSPVDL ELPKFTFDTS RDLKSPLESL GMQTVFDPNS ADLSGVPAKP ESLFVGAVVQ
KTHVAVDEDG TTAAAASGVT VVAGAAPQQS PPAEMHVDRP FLFLIRDTVT GQILFLGQVS
DPRG