Gene Caci_3294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3294 
Symbol 
ID8334647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3631610 
End bp3634939 
Gene Length3330 bp 
Protein Length1109 aa 
Translation table11 
GC content69% 
IMG OID644956439 
Productcoagulation factor 5/8 type domain protein 
Protein accessionYP_003114042 
Protein GI256392478 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCCA CCGGAAGCCG CACCCCCGCC CCACCTCTGC GCAGACGGCT CACGGCGCTG 
GCCACGGCCG CGGCGAGCAC GGTCGCCGGC GCGGCCCTGC TGGTCGGCGC CTCGCCGGCG
CACGCCATGA ACGGACCGGG CACCCCGCCG TACTGGGCCC AGTCGCCGTT CAGCGTGCCG
AGCGGGACCG GCGCGAGTCT GCCGTTCACC GAGTACGAGG CGGAGGCTTC GACCACCACC
GGGACGCGCG TCGGGCCGGA CTTCACGCAG GGCTCGCTCG CCAGCGAGGC CTCCGGCCGT
GAGGCGGTGC AGCTCACCGG TTCGGGCCAG TACGTGCAGT TCACGCTCAC CAGCGCCGCC
AACGCCTTCG ACCTGCGCTA CTCGCTCGCG CAGGGCGCCT CCGGCTCGCT GTCGGTGTAC
GTCAACGGCA CCAAGCAGAG CAAGGAGCTG TCGCTGACGT CGGCGTACAG CTACATCAGC
ACCGGCGGCA TCACCGGCAG CAAGACGCAC AAGTTCTTCG ACGACACCCG CATGATGTTC
GGCCAGACCC TGGCCGCCGG GACCACGGTG AAGGTGCAGG TCGACTCCTC CGACAGCGCC
GTGCCCTACA CCGTCGACGT CGCAGACTTC TACAACGTCC CGACCGCGGC GAGCCAGCCC
GCCGGTTCGG TCTCGGTCGT CACCGAGGGC GCGGACCCCA CTGGCGCCAA CGACTCCAGC
AACGCCTTCA ACACCGCGAT CAACGCCGCC AACGCCGCGA ACCAGTCGGT CTGGATCCCG
CCGGGCACCT ACCTGGTCAC CAACCCGATC CAGACCCAGA AGGCGACGAT CGTCGGCGCC
GGCAACTGGT ACTCGCAGAT CAAGACCAAC ATGTTCATCC GCAACTCCTC GGCGGTCTCC
GGGCCGGTGA ACCTCAGCGG CTTCGCGATC CTGGGCAGCA CCGTCGGACG CCATGACGAC
AGCTCGGCGA ACGGCATCGA CGGCTCGCTG GGCAACGGCT TCACCGTCAA CGGCTTGTGG
ATCCAGGACA CCAACGTCGG CTTCTGGCTG CAATACGGGA ACAGCAACGG CACCGTGGAG
AACACGGTCG TGGAGTCCAC CGACGCCGAC GGCCTGAACT TCAACGGCAA CGCCAGCGGC
AACGCCAGCG GCAACACCGT GAAGAACAAC TTCCTGCGCG GCACCGGCGA CGACGCCCTG
GCGATCTGGT CCTACCCGAC CGCCGACTCC AACATCACCT TCGCCAACAA CACGATCGTG
GCGCCGACGC TGGCCAACGG CATCGCGGAC TACGGCGGGG CGAACAACAC GATCTCCAAC
AACGTGATCG CCGACGACAA CGCCCTGGGC AGCGGTCTGA CGATCTCCAA CGAGGCGTTC
CTGCAGCCGT TCTCGCCGTT GTCCGGGACC ATCACGGTCT CCGGCAACTA CCTGATCCGC
GCCGGCGCGT ACAACCCGAA CTGGGCGCAC CCGATGGGCG CCGTGCAGTT CGACTCCTAC
GACTCCGATT TCAGCAACGT GACGGTCAAC TACAGCGGCG GGGCGATCCT GGACAGCCCG
TATGAGGCCT TTGAGATCGT CGGCGGAGAC GGGACCGGGC ACGTCGTCAA CGGGCTGAAC
ATCAGCAACG TGAAGGTGCA GAACACCGGT ACCACCGTCT TCCAGGCGGA GACCGGCGGC
GCGGCGAGTG TCAGCGGCCT GACAGCCAGC GGGCTCGGCG TCTCAGGCAC GTACAACAAC
AGCTACCCCG GCAACGTCGC GGGCGCCTAC ACCTTCAACC TCGGCAGCGG GAACTCCGGC
TGGAGTACCA CCCCGGTGCT CACCACGTTC CCGGATCCGG TGCAGCCCGG CGCTCTGCAC
GCCTCGCCGG CCGCGCTGTC GTTCGGCGAC GTGAAGTCCG GCACGACCAG CGCGCCGCAG
TCGGTGACGG TGACGAACCC GGGCACCAGC GCCGCGCCGA TCTCCTCGAT CAGCGCCACC
GGGCCCTTCT CGCAGACCAA CAACTGCGGC AGCTCCCTGG CCGCCGGCGC CTCCTGCACC
GCGCAGGTGA CGTTCGCCCC GACCACCGGC GGCAACGCCA CCGGGACGCT GACCGTCGCG
ACCAGCGCTC CGGGCGGTCC GCTGAGCGTC GCGCTGTCCG GACGCGGCAT CACCTCCACG
ACGAACCTGG CGCTGGGCCA GCCGGCCACG GCGAGCAGCA CGCAGGGCAC GTTCGTGGCG
GGCAACGCCA CCGACGGGAA CACCGGCAGC TACTGGGAGA GCGCCGACGG CGCGGGCTAC
CCGCAGACCA TCACGGTGGA CCTGGGCTCG ACGCAGCCGA TCGGCTCCAC GACGCTGAAC
CTGCCGCCGT CCTCGGCATG GGGCGCGCGT ACGCAGACGC TGTCCATTCT GGGCAGCACT
GACGGGACGA ACTTCACGCA GATCGTCGGC TCGGCCGCGC ACACCTTCGA CCCCGCGAGC
GGAAACACTG CGACCATCGC GCTGCCGTCC GGCACGAGCG CCCGTTACGT ACGCCTGAGC
TTCACCGGCA ACACCGGCTG GAGCGCGGCG CAGCTGTCGG AGTTCCAGAT CTTCCCCGGC
GGCGCGGCCA ACGGCAGTGC CCTGACAGCG AACCCGTCGA ACGTGTCGTT CGGCAGCGTG
GCAGTCGGCT CGACGAGCAG CGCGCAGACC GTGACGGTGT CCAACCCAGG CGGCACTGCG
GCAGCGATCT CCTCGATCAG CACGAGCGCA CCCTTCTCGC AGACCAACAC CTGCGGCACC
TCCCTAGCCG CCGGCGCCTC CTGCACAGTC AAGGTGACCT TCACCCCGAC CACCGCCAGC
AGCGCGAACG GCACCCTCTC AGTCGCCAGC AACGCCCCCG GCAGCCCACT GACCGTAGCG
CTCTCCGGCA CCGGCACCTC CACCGGCGGC AACACCAACC TCGCCCTCAA CAAGCCCACA
ACAGCCAGCG GCACCACCCA GAACTACACC CCCGGCAACA CCGTCGACGG CAACACCAGC
AGCTACTGGG AGAGCACCGA CAACGCCTTC CCGCAGTGGC TCCAGGTCGA CCTCGGCGCC
TCCGCCAGCG TCAGCCGCAT CGTGATGGAC CTCCCGCCCT CCTCCTCATG GGGCGCCCGC
ACCCAAACCA TCCAGATCCA GGGCAGCACC GACGGCACGA ACTTCACCAC CCTCGCCCCC
TCCCACGCCT ACACCTTCGA CCCCGCCACG GGGAACACGG CGACCGCCAC CTTCACCGCC
GCCACCGTGC GCTACGTGAG GCTGACGTTC ACCGCCAACA CCGGCTGGCC GGCCGGTCAG
CTCTCGGAAC TGCAAGTGTT TTCCCAGTAG
 
Protein sequence
MSATGSRTPA PPLRRRLTAL ATAAASTVAG AALLVGASPA HAMNGPGTPP YWAQSPFSVP 
SGTGASLPFT EYEAEASTTT GTRVGPDFTQ GSLASEASGR EAVQLTGSGQ YVQFTLTSAA
NAFDLRYSLA QGASGSLSVY VNGTKQSKEL SLTSAYSYIS TGGITGSKTH KFFDDTRMMF
GQTLAAGTTV KVQVDSSDSA VPYTVDVADF YNVPTAASQP AGSVSVVTEG ADPTGANDSS
NAFNTAINAA NAANQSVWIP PGTYLVTNPI QTQKATIVGA GNWYSQIKTN MFIRNSSAVS
GPVNLSGFAI LGSTVGRHDD SSANGIDGSL GNGFTVNGLW IQDTNVGFWL QYGNSNGTVE
NTVVESTDAD GLNFNGNASG NASGNTVKNN FLRGTGDDAL AIWSYPTADS NITFANNTIV
APTLANGIAD YGGANNTISN NVIADDNALG SGLTISNEAF LQPFSPLSGT ITVSGNYLIR
AGAYNPNWAH PMGAVQFDSY DSDFSNVTVN YSGGAILDSP YEAFEIVGGD GTGHVVNGLN
ISNVKVQNTG TTVFQAETGG AASVSGLTAS GLGVSGTYNN SYPGNVAGAY TFNLGSGNSG
WSTTPVLTTF PDPVQPGALH ASPAALSFGD VKSGTTSAPQ SVTVTNPGTS AAPISSISAT
GPFSQTNNCG SSLAAGASCT AQVTFAPTTG GNATGTLTVA TSAPGGPLSV ALSGRGITST
TNLALGQPAT ASSTQGTFVA GNATDGNTGS YWESADGAGY PQTITVDLGS TQPIGSTTLN
LPPSSAWGAR TQTLSILGST DGTNFTQIVG SAAHTFDPAS GNTATIALPS GTSARYVRLS
FTGNTGWSAA QLSEFQIFPG GAANGSALTA NPSNVSFGSV AVGSTSSAQT VTVSNPGGTA
AAISSISTSA PFSQTNTCGT SLAAGASCTV KVTFTPTTAS SANGTLSVAS NAPGSPLTVA
LSGTGTSTGG NTNLALNKPT TASGTTQNYT PGNTVDGNTS SYWESTDNAF PQWLQVDLGA
SASVSRIVMD LPPSSSWGAR TQTIQIQGST DGTNFTTLAP SHAYTFDPAT GNTATATFTA
ATVRYVRLTF TANTGWPAGQ LSELQVFSQ