Gene Caci_3687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3687 
Symbol 
ID8335040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4127166 
End bp4129190 
Gene Length2025 bp 
Protein Length674 aa 
Translation table11 
GC content68% 
IMG OID644956827 
Productcoagulation factor 5/8 type domain protein 
Protein accessionYP_003114430 
Protein GI256392866 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0980417 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGAA TCAAGCGGCG ACCATACGCC ATCCTGGTGT TGATGATCGC CCTCGTGGCG 
AGCGTGATGC TCGTCAGCAC CGCCCCGAAG GCCCACGCCG CCGGCACTTT GCTGTCGCAG
GGCAAGCCGG CCACGGCCTC CTCGGCCGAG AACGCCGGCA CGGCGGCGAC CGCCGCCGTC
GACGGCAACA CCGGAACCCG CTGGTCCAGC CAGTTCAGCG ACCCGCAGTG GCTGGAGGTG
GACCTCGGTC AGGCCTCGAC GATCAGCCAG GTCGTGGTCC AGTGGGAGAC CGCCTCGGCC
AAGGCGTTCC AGATCCAGAC GTCGAACGAC GGCACGAACT GGACGTCGAT CTACTCCACG
ACCACGGGGA CCGGCGGTAC CCAGACGCTG AACATATCCG GGTCCGGACG GTACGTCCGC
ATGTACGGCA CAGCCCGGAA CACCGCGTAC GGCTACAGCA TCTGGGAGTT CCAGGTCTAC
GGCAGCGCGG GTACCGGCAC CGGCGGCGGC AGCTGCGTCA ACAACGCGGC GCTGAACCAC
CCGGCCACCG CCTCCTCGGC CGAGAACGCC GGCACGTCGG CCGCGAACGC CGTCGACGGC
AACACCGGAA CCCGCTGGTC CAGCCTGTTC ACCGACCCGC AGTGGCTCCA GGTGGACCTC
GGTTCGACCC AGCAGATCTG CGGGATCCAG CTCCAGTGGG AGGCCGCGTC CGGCAAGGCG
TACCAGATCC AGACGTCGAA CGACGGCACG AACTGGACGT CGGTCTACTC CACGACCACG
GGTCCCGGCG GCACCGAGAA CCTGACGGTC AGCGGCTCGG GCCGTTACGT CCGCATGTAC
GGGACCGCGC GCAACACGGC GTACGGCTAC TCCCTGTGGG AGTTCCAGGT CCTGACCGCC
ACGGTCGTCG GCGGTGACAT GATCACCGTC ACCAACCCGG GCAACCAGAC CGCCGTGGTC
AACACGGCCG TCAGGCTGCA GATGGGCGCC ACCGACTCGG TCGCCGGCCA GACGCTGACC
TTCACCGCGA CCGGGCTGCC GGCCGGGCTG TCGATCAGCT CCTCGGGCCT GATCACCGGC
ACGCCGACCA CCGCGGCCAC CTCCAGCGTG ACCGTGACCG CCAAGGACAC CACCGGCGCC
ACCGGCTCGA CCACCTTCGG GTGGACGGTC AACGCCACCG GCGGCGGCAC CGGCGCCCCG
CCGACCGCGT TCTGGGGCAA CGTCTCGGCC ATCCCGCCGG CGGCGCACGT GATGGAGTTC
ATGATCCAGA ACCAGACCAA CGGCCAATAC CCCGACAGCC AGGTCTACTG GAGCTTCAAC
GGCCAGACCC AGTCCATCGC GCAGCAGCGC TACATCGACA TGCCGGCCAA CTCCGCCGGC
CGCATGTACT TCTACCTCGG CACGCCGAAC GGTCCCTACT ACGACTTCAT CGAGTTCACC
GTGGGCACCA CCTTCATCAA CGTCGACACC ACCCGCGTCG ACCGCTTCGG CCTGAAGCTC
GCCCTCCTGG TCCACGACCA CAGCGGAAAC GAGCAGGAGA TCGGCGAGAA CTACGCCACC
TTCCAGGAGA GCCGCACATC GACCTTCAAC CGCTTCCAGT CCTCGGTCCC CACCGAGTTC
AAGGAACTGG CCACGGACAA CGCCCCCTAC GGCATCCCCT CCCCAGGCAA CGACGCCGCG
TTCCAGTCAG GCGGCCCGTA CGCGAACTAC TTCCAGGCCT ACGCAGCAGC CAACGGCGAC
ACCGCCGACA GCACCCCGCA GATCTTCGGC TGCGGCGGCA CCCTGTCCGG CAACCCCCAA
CTGTGCGCCG GCCTGAACCG CCACGTAGCC CAGCTCCCAG CAGCCCAGCA GTCGATCCCA
GCCAACTTCT ACCAGGCCGG ACCGGCAAAC TACTACGCCC AGTTCTGGCA CCAGAACGCC
ATCAACGGCA TGCAGTACGG CTTCCCCTAC GACGACGACG CAGGCCAGAG CTCGGACATC
TCGGTGAACA ACCCGCAGTA CGCCGTCGTC GCGGTGGGCT GGTGA
 
Protein sequence
MTGIKRRPYA ILVLMIALVA SVMLVSTAPK AHAAGTLLSQ GKPATASSAE NAGTAATAAV 
DGNTGTRWSS QFSDPQWLEV DLGQASTISQ VVVQWETASA KAFQIQTSND GTNWTSIYST
TTGTGGTQTL NISGSGRYVR MYGTARNTAY GYSIWEFQVY GSAGTGTGGG SCVNNAALNH
PATASSAENA GTSAANAVDG NTGTRWSSLF TDPQWLQVDL GSTQQICGIQ LQWEAASGKA
YQIQTSNDGT NWTSVYSTTT GPGGTENLTV SGSGRYVRMY GTARNTAYGY SLWEFQVLTA
TVVGGDMITV TNPGNQTAVV NTAVRLQMGA TDSVAGQTLT FTATGLPAGL SISSSGLITG
TPTTAATSSV TVTAKDTTGA TGSTTFGWTV NATGGGTGAP PTAFWGNVSA IPPAAHVMEF
MIQNQTNGQY PDSQVYWSFN GQTQSIAQQR YIDMPANSAG RMYFYLGTPN GPYYDFIEFT
VGTTFINVDT TRVDRFGLKL ALLVHDHSGN EQEIGENYAT FQESRTSTFN RFQSSVPTEF
KELATDNAPY GIPSPGNDAA FQSGGPYANY FQAYAAANGD TADSTPQIFG CGGTLSGNPQ
LCAGLNRHVA QLPAAQQSIP ANFYQAGPAN YYAQFWHQNA INGMQYGFPY DDDAGQSSDI
SVNNPQYAVV AVGW