Gene Caci_5044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5044 
Symbol 
ID8336398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5788836 
End bp5791145 
Gene Length2310 bp 
Protein Length769 aa 
Translation table11 
GC content70% 
IMG OID644958143 
Producthypothetical protein 
Protein accessionYP_003115745 
Protein GI256394181 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCGAC GACATCAGCG CACGGCGACT CTGGTCGCCC TCGGCGCGGC CTTCTTCGCC 
AGCGCGATCG GCAGTGCGTC CGCCATGGCG GCGACGCCCC ATGCCGCCGC GCCGCAGGCC
GCCACGCAGA GCGTCAGCTA TCTGGGCCAC CAGTTCACCG TCCCGGCAAG CTGGCCGGTC
ATCGACCTGG CGAAGGCGCC GACCACCTGT GTCCGCTTCG ATGAGCACGC CGTCTACCTC
GGCCAGCCGG GTGCGCAGCA GGACTGCCCC AGCAAGGTCT TCGGACGGAC CGAGACCCTG
CTGATCCAGC CCGCCGCCGC CTCCACGGCA GCGGCCATGA CCACCGACAA CTCCGCCACC
CGCGAGCTCG ACACGACCGG CGACGGCTTC AAGGTCAGCG CCACCTACAA CACCGACCGC
GCGCTGGCCC AGTCCATCCT GACCAGCGCC GCGCTCCCGG CACCGTCCGC CACGGCGCAC
ATACCGACGC CGGGCACGGT GACCGCGCCG ACGTCCACCG CGCCGACGAG CAAGGCAGGT
CAGGCGAGCA CGTCCACGCA GTCCGCACAC TCACTGGCTA CCGCCGCCGT CGCGGCCAGC
AGCACCAACT TCACCGGCCA AGGCTTCGAC GCCTGCGCCG CGCCGAGCTC GTCGGCGATG
AGCGCGTGGA AGAGTTCCTC GCCCTACTCC GCCGTCGGCA TCTACATCGG CGGGGCGAAC
CGGGGCTGCG CGCAGCCGAA CCTCACCTCC ACCTGGGTCT CCGACGAGGC GGCGGCCGGC
TGGCGCTTCC TGCCGATCTA CGTCGGCCTG CAGGGCCCTG GCAACGGCTG CGGGTGCGCG
GCCATCAACT CCGCGAGCGA GGGCACCGCC GCCGCGGACG ACGCCATCAA CGACGCCGTC
TCCCTCGGCT TCCCGGCCGG CACCGAGATC ACCTACGACA TGGAGGCCTA CACCACCGGC
GGCTCCTACT CCTCGCTGGT GGTCGGCTTC GAAGCCGCCT GGTCCGCCGA GCTGCACGCC
CACGGCTACC TGTCCGGCGT CTACGGCAGC ATGGGGAGCA CGGTGTCGGA CCTGATCAAC
AACTACAGCT CCACCACCAT GCCGGACGTC CTGGACTTCG CCAGCATCCC CGGCAGCGGC
AGCAGCACCG TCTCCGACCC CGGCATCCCC AGCGCCGACT GGGCCAACCA CCAGCGCATC
CACCAGTACA CCCAGGGCCA CGACGAGACC TGGGGCGGCG TGGACATCCC CATCGACGCC
GACTACTTCG ACGTCCAGGT GTCCTCCAGC GCCCCACCGC CGAGCGCTCC GCACAGCAGC
GCCTCGGGAC TGGCCGTCGC CTCCAACGGC GGGTTCAACA CCGCTTGGAA GGGGACTGAC
GGCTACCAGT GGGTGGCCAA CGGCAGCGGC GCGGGCATCT CGGCCAAGGG CAACCCGTTC
CTGCTCGGCG TCGCGGCGAA CACGACTCCG TCGATGGCGA CGCTGTCCGA CGGTTCATGG
ATCTCGGCGT GGCAGGGCAG TGACGGCTAC CTGTGGCTGG CCACCGGCTC CGGAGCGAAC
ATCTCGGCCA AGGGCAACCC GTTCCTGCTC GGCGTCGCCG CCGGCACCAG CCCGTCGATC
GTCGCGCTGC CCAACGGCGG CTGGGAGATC GCGTGGAAGG GTCAGGACGG CTACCTGTGG
CTGGCCACCG GCTCCGGCAT CAACATCTCC GCCAAGGGCA ACCCGTTCCT GCTCGGCGTG
TCCGGCACCA CCAGCCCGTC TCTGGCGGCT CTGCCCAACG GCGGGTTCGA AGCGGCGTGG
AAGGGCGGGG ACGGCTACCT GTGGCTCGCT TCCGGCTCCG GTATCACCAT CACGGCTAAG
GGCAACCCGT TCCTGCTCGG CGTCGTCAAC AACCCGGCGC TGGTGACCAT GCCCGACGGC
AGCTTCGAGG CGGCTTGGAA GGGCGGCGAC GGGTACCTGT GGCTCGCCTC CGGCTCCGGC
GCCACGATCA CCGCCAAGGG CAACCCGTTC CTGCTCGGCG TCTCCGGCGA CACCAGCCCG
TCGATCGCGG CCCTGCCCAG CGGCGGCTTC GAGACGGCGT GGAAGGGTAA CGACGGCTAC
TTGTGGCTGG CCACCGGCAA CGGTGCGAAC ATCACGGCCA AGGGCAACCC GTTCCTGCTC
GGCGTGGCGA ACAACCCCGA GCTCGTGACC AAGTCTGACG GCAGCTTCGA AGCGGCGTGG
AAGGGCGGCG ACGGCTACCT GTGGCTCGCC TCCGGCTCCG GAATCAACAT CTCCGCCAAG
GGCAACCCGT TCCTGCTCGG CGTCGCGTAA
 
Protein sequence
MIRRHQRTAT LVALGAAFFA SAIGSASAMA ATPHAAAPQA ATQSVSYLGH QFTVPASWPV 
IDLAKAPTTC VRFDEHAVYL GQPGAQQDCP SKVFGRTETL LIQPAAASTA AAMTTDNSAT
RELDTTGDGF KVSATYNTDR ALAQSILTSA ALPAPSATAH IPTPGTVTAP TSTAPTSKAG
QASTSTQSAH SLATAAVAAS STNFTGQGFD ACAAPSSSAM SAWKSSSPYS AVGIYIGGAN
RGCAQPNLTS TWVSDEAAAG WRFLPIYVGL QGPGNGCGCA AINSASEGTA AADDAINDAV
SLGFPAGTEI TYDMEAYTTG GSYSSLVVGF EAAWSAELHA HGYLSGVYGS MGSTVSDLIN
NYSSTTMPDV LDFASIPGSG SSTVSDPGIP SADWANHQRI HQYTQGHDET WGGVDIPIDA
DYFDVQVSSS APPPSAPHSS ASGLAVASNG GFNTAWKGTD GYQWVANGSG AGISAKGNPF
LLGVAANTTP SMATLSDGSW ISAWQGSDGY LWLATGSGAN ISAKGNPFLL GVAAGTSPSI
VALPNGGWEI AWKGQDGYLW LATGSGINIS AKGNPFLLGV SGTTSPSLAA LPNGGFEAAW
KGGDGYLWLA SGSGITITAK GNPFLLGVVN NPALVTMPDG SFEAAWKGGD GYLWLASGSG
ATITAKGNPF LLGVSGDTSP SIAALPSGGF ETAWKGNDGY LWLATGNGAN ITAKGNPFLL
GVANNPELVT KSDGSFEAAW KGGDGYLWLA SGSGINISAK GNPFLLGVA