Gene Caci_4979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4979 
Symbol 
ID8336333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5692635 
End bp5695817 
Gene Length3183 bp 
Protein Length1060 aa 
Translation table11 
GC content68% 
IMG OID644958078 
Productcoagulation factor 5/8 type domain protein 
Protein accessionYP_003115680 
Protein GI256394116 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00350881 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0761827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAACG GATTGACCCG GCGGCAGTTC ATGAACGGGA TCGGGGGTGC GGCGCTGGCA 
ACGACGGTGT TCTCCTCCGG CGCCGCTTTC GCAGCGGACG GCGCTGGCAC TGGCACCGAG
CGAGCGCCCG CGCCCGCGAC CCCGGCGCCC GACGACACGG TTGCCGCCGC GTACTACCAA
TACCTGCTGC GGCACACGAA TTGGGCGCAG CAGAAGTGGG ACGCGACTGC CGGCCACTAT
GCGGCGAGTG ACTACAACTT CGCCGTGGTG CTCGGCAACG CTGTCTTGCT GACGCACGGC
ACCTATGACG CATCCGTCGC GGGCGTGGAT GCCGCGACCC TGAAGACCCA GACCCTCGCC
ACGATCGACC ACTTCGCCGC CAGCAATGTG CTCAACGGCG GTACCGAGTG GGGCGAGACG
ATGTTCTTCG ACAGCACCTT CGAGCTCTAC TTCATCCTCG CCGCCAAGTT GTTGTGGAAC
GACCTCGACG CCGCGACGCA GACCCTGATC GACAAGATGA CCGCGGCGCA GGCGGCGTAC
ACCACGGCCC TGGGCAGCGG GAACGATCCG CGCAGCGGCA GCTGGTCGCC CAACGGGCTG
GCCGGTGGCT GGGAGGGCGA CACCAAAGTC GACGAGATGG CTGTGTACGC GCAGTGTCTC
GGACCGGCTG TGGCCTGGCT TCCACAGCAC CCTGACAATC CCACTTGGAG CACCTGGCTG
ACCACCTGGA TGCTCAACGA CACCGGGTTG CCGTCGGCCG ATCAGGCCAA CCCCACCGTC
GTGGACGGTC GGCCGATCTC GGACTGGAAC ACCGCGCACA ACATCTTCGA CACCTTCTTC
GTGGAGAACC ACGGCTCCTT CGAGCCGCAC TACCAACTCG AGACCTGGCG CATGTCCGCG
CGAGTGGCCG CGCATTTCCT GGCCGCGGGC CGGCAGATCC CGGCCGCTGC CGGGGCGGCC
AAGCCGAACG CCGCGCAGCT GTGGCGCACC ATCCGGCATG TGCAAAGCGA CAGCGGCGAG
CCGTTCATGC CGATGAATCC CGACCGATAC CACCTCTTCG GTCGCGACGT GCTGCCGCTG
GCGTTCCTGG CGCAGGTCAT GGGCGATCCG CTGGCGGCGC GTGCCGAAGC GAACATGGCC
GCGCAGCTCG GGCCGTACCA GCTTTATCCG CCGGAGTATC AGCTCACGAA GTTCAGCGGG
GAGGCGAAGT ACGAGCCGGA GGCGCGGGCC GAGCTGGCGA TCAGTTACCT GTTTCATGTG
TGGCGTGCGC AGCAGGGGGC GCCGGTTCGG CCGGTCACGG GGGAGCAGTT CGTCGCTTCG
GCGTCTGGCG CTACCGACTA TGGCGCCGTC CCCGGGTTGC TCGTGCACAA TACGGCGAAC
GCGTTCGCTG CGACCGTTTC CAAGCCGGGG TATGTGAAGT TCTGCTACGC GCCCAACCAT
GACGACTGGC TGTTCGACCT CTCCGGCGCG GCGCCCTCGC TGCTGCCTGC GACCGGGGCG
ACGGTCACGA ACCGCTTCGC TGCCGCTTAC AGCACCTTGC GTGACGGGTA TGACGCGAGC
GCTTCGCTGT TGACGTTGGC GAGTGGGTAC GCGGGATATG CGACGCTTCC CGATGGCGGC
GTGGTGTACG CGACCAGCGG GAACGGTGCC GGGGAAGGTG TGCTGAATGT GTTCAACCTG
GCGATGCCTG GCATCGCGGG GCTGGACGGG AGCCGGACTT ACGCCGGTGC CGGCGGCGCT
TTCACCGTCT CGGCTGGCGA TGTGCAGACC GGCGGCACTA ACGATCTGTC CTTCAGCGCG
GTGACGGCGC GGTACGTCCG GATGCTCGGT ATCAGGCCTG CTACACAGTA TGGCTACTCG
ATCTACGAGC TTCAGGTCTA CGCGCCTGGC GGCACGACCA ACCTGGCGCA GGGTAAGGCC
ACGACTGCTT CGTCCTTCAC GCCTGCCTAT CCGCCGCCCG CTGCCACCGA TGGCAATCCG
GCCACGCGGT GGGCGGTCGC CGTCGCCAGC CGATCGGTGC CGGACAGCTG GCTGCAGGTT
GACCTGGGAG CCGCCACGCA GATCGACCGC GTGTCCATCG CCTGGGAAGC TGCTTACGGC
GCGGCCTTCG CTATCCAGAC CTCGGACGAC GGCAGCACCT GGAGCACCGC GGCGGCCTTG
CCTGTGCAGC ATCACGTAGA CGGCGGTTGG GTGAACGTCG ACGGCCGTGC GGGATTCGTC
GTGCGCGGCA GCCGCAATCC GCTGACCGTG ATCGGGAACA CGCTCACCCT GTCCGACGGG
CCGGCGACCG GCGCCGCCGG GATGGTCGTG GAGGGCTATC CCGCGCAGAC GCCGCAGGGC
ACCGCAGCCA TGGCGGCCTT GCCGACGCCG ACGCCGACGC CGACGCCGAA CAGCACTGTC
GCAGGGCTCG CGGCGAGTAT CGCCGGGTCG CACCTGAGCC TGTTCAACCT CAGCGGACAG
AACATCGGTA CCGAATCAGC GCCGGCCGAG CTGAGCGTTC CAGCTTCGGG GCGTGCGCGG
CTGCTGTACC GCGGCCTACA GACCACCACC GCCACCGGGA CGGTCTACGA CGTCGTACTC
GCCGCCGCGA CCGCGCGCGT CGAACCCCCG CGCTTCACGC TGACCGGGAC CATTCCCGCA
GGCGTCGTAG CAGAGGTAGC GGACTCCCTG CACCTCACGC TCACAGCGCC CCCGACGAGC
GGCTGCACCC TGACCGTCGT CTCCCGCTCC AACGCCACAA CCCGAACCAT CTCCCTGCGA
GCCGGCCAGC AGAAACAGCT GACCTTCCCC GGAACCCTCA CCCCAACCCC CGACCTGGCC
CTGGGCCGCA CCACCTACCC CACCTCACCC CTACCGGCCG GCATGTCCGA CCCCGCCTCC
GCCGTAGACG GCAACCCCCA CACCGCCTGG ACCCCCGGCA CCCCCACCGC CCGCATGGTC
ATCGACCTCG GCACCCCCAC GCCCCTGGCA ACCCTGACCA CCCACTGGAC CACCCCCCAC
ATCCCCCCAT TCGCCATATC AGCATCCGCC GACGGCCAGC ACTACACCCC CCTCACCACC
AGCAACCCCC ACCACCCCGA CCAACCCCTC CCCATCAACA CAACCACCCG CTACCTAGCC
CTCACCCTCC AGAACTGGTC GCCCGCCCAC GCCAGCCTGA CCGCACTTTC GCTCACGTCC
TGA
 
Protein sequence
MRNGLTRRQF MNGIGGAALA TTVFSSGAAF AADGAGTGTE RAPAPATPAP DDTVAAAYYQ 
YLLRHTNWAQ QKWDATAGHY AASDYNFAVV LGNAVLLTHG TYDASVAGVD AATLKTQTLA
TIDHFAASNV LNGGTEWGET MFFDSTFELY FILAAKLLWN DLDAATQTLI DKMTAAQAAY
TTALGSGNDP RSGSWSPNGL AGGWEGDTKV DEMAVYAQCL GPAVAWLPQH PDNPTWSTWL
TTWMLNDTGL PSADQANPTV VDGRPISDWN TAHNIFDTFF VENHGSFEPH YQLETWRMSA
RVAAHFLAAG RQIPAAAGAA KPNAAQLWRT IRHVQSDSGE PFMPMNPDRY HLFGRDVLPL
AFLAQVMGDP LAARAEANMA AQLGPYQLYP PEYQLTKFSG EAKYEPEARA ELAISYLFHV
WRAQQGAPVR PVTGEQFVAS ASGATDYGAV PGLLVHNTAN AFAATVSKPG YVKFCYAPNH
DDWLFDLSGA APSLLPATGA TVTNRFAAAY STLRDGYDAS ASLLTLASGY AGYATLPDGG
VVYATSGNGA GEGVLNVFNL AMPGIAGLDG SRTYAGAGGA FTVSAGDVQT GGTNDLSFSA
VTARYVRMLG IRPATQYGYS IYELQVYAPG GTTNLAQGKA TTASSFTPAY PPPAATDGNP
ATRWAVAVAS RSVPDSWLQV DLGAATQIDR VSIAWEAAYG AAFAIQTSDD GSTWSTAAAL
PVQHHVDGGW VNVDGRAGFV VRGSRNPLTV IGNTLTLSDG PATGAAGMVV EGYPAQTPQG
TAAMAALPTP TPTPTPNSTV AGLAASIAGS HLSLFNLSGQ NIGTESAPAE LSVPASGRAR
LLYRGLQTTT ATGTVYDVVL AAATARVEPP RFTLTGTIPA GVVAEVADSL HLTLTAPPTS
GCTLTVVSRS NATTRTISLR AGQQKQLTFP GTLTPTPDLA LGRTTYPTSP LPAGMSDPAS
AVDGNPHTAW TPGTPTARMV IDLGTPTPLA TLTTHWTTPH IPPFAISASA DGQHYTPLTT
SNPHHPDQPL PINTTTRYLA LTLQNWSPAH ASLTALSLTS