Gene Caci_5033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5033 
Symbol 
ID8336387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5767639 
End bp5770644 
Gene Length3006 bp 
Protein Length1001 aa 
Translation table11 
GC content68% 
IMG OID644958132 
Productcoagulation factor 5/8 type domain protein 
Protein accessionYP_003115734 
Protein GI256394170 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGATCA GTAGACGTAC AACCGGCGCC CTGCTGGCAG GCGCCCTCGC CCTGGCGGGC 
CTGTCCACCT CGGCCGCCCT GACCGCGGCG CCCGCCCACG CCGCGTCCGC GCCGACCACC
CCGATCTGGT CCACCCAGCT CGACTTCGAC AACGGCGGCG CCGCCTGGTC AGAGCCCTAC
TTCGCGGCGC TGGCGGCCAA AGGGCTGACC ACCGCCGAGC TGAACATGCC CTGGGGCACG
ATCGAGCCGT CGGCCGGGAC CTTCAGTTTC ACGATCTGGG ACCAGGAGTT GGCGAACGCC
GCCGCTGCCG GCATCCAGCT GATCCCGGTC TTCTGGCAGT CCGGGTGGGG CGGCAGCCCC
GCACCGTGGA TCACCGACTT GGAGAAGACC AGCACCGGGG CGGCAGGCGT GGCTCCGGAC
TGGTGGAACA CCACCGAGCA GGCGCAGTAC TTCACCTATG TCGAGAACAC CATCCAGAAC
TCCATCGCAC AGCCCGGCGG CTACGGCGGC GCGGTCCTGG ACTACGGATT CCTCGACGCG
CAGTGGGACA TCAGCGGCTC CGGCGGCGGC TATGCCAGCG GCGACATCAC CGAGTTCCAG
AACGTGTACC TGCCGAACGC CTTCGGCACC ATCGCCGCCT TCAACGCCGC CGAGGGCACG
TCCTACACAG CCTTCAGCCA GGTACCTGCG CAGGCTTCCG GACAGCCGTT GTTCGGGGTG
TTCCAAGCCT TCCGCGCCTG GAGCGTCGAG CAGACCTACG GTGCGCTGAC CGCCGCCGTC
CGCAAGATCA CCGCGAACAC GCCGCTGTAC TACTACTACG GCGGCAGCTA CGGGAACGTG
ACGAACTACG CCAACAACCC CGACAGCTTC TTCAAGCTCG CCAAGCAGTA CAACGTCACC
ATCATCGCCG ACTCGGCCAG CAACACCGGC ATGACGCTGG CGATGACGAG CCTCGGGCGC
GCCTACGGCG TGAAGGTCGC CGAGGAGTGG ACGGCGCCGA ATTCGGACTC TGAGTTGGCC
GCGTACGCCG TGCAGTGGCT CGACAGCTAC GGGATGACGT TCCCGCAAGC CGGCGGCGAG
GACTTCTTCA TCCACGACGG CACCTCGAAG GACACCGTCG GCTACCCGAT CTACACCAGC
TGGCTGCCGA CCCTGAAGAG CCTGTCGGGC ACCTACCCGC AGCAGCCCAC CGCGCTGTAC
ATCGACGTCT CGCAGGGCTA TGGCAACACC AACGGCGGCA GCCTGAACAC CGTGGAGAGC
CAGGCGGCGG CCATCTGGAA CAGCTTCCAG TCCGGACTGG CGGTCGTCAC CAGCCAAGAG
GTGGCGAACG GCGCGGTGAG CCTGTCCTCG TTCAACGCCG TGCTGCCGCT CAACGGGGTC
GATGCGAATC TGACCTCGTA CAAGAACGGC GGCGGCGCCC TGCTGACGTC CGCGGCGCAG
CTGACTCAGC ATGCGAGCGC CTACGCGGTG ATCGACGCGC CCTACGTCGG CGACGTGCAA
GCCGTGCCGG TCCTGGCGGC CAGTCACACC AGTGCCTCGC TGACCTTGGC GGACATCACC
ACCGGAACCG CCTACAACGC GCCGATCGCG ATCAACCCGG CCGGGCTCGG CCTGAACTCG
GGCAGCTACT ACGTCGTCAA CGCCGCCGGG ACAGCACTCC CCCAGACCGT CCAGTCGAAC
GGACAGATCT GCGTGAGCGC GAACCTCGGC GCGGCGAGCC TGGCCGAGTG GACCGTCAAG
GCCGGGCCGG TGCCCGCCGG GACCGCCTCG TCCGGCTGTC CGACCACGTA CACCGGAGCC
ACGTCGGTGA GCGCCACCGC CGGCCAGTCC GGCGGCGGGT TGACCTTCCT GGGCGTCGGC
GCGACGAACC AGGGCTCTGA CGGAAACCTG ACACAGATCA CCCAAGGCGG CCAGACCGCC
TATGAGACCT GGACGTCCGC GCAAAGCGGC GCGACCGGCT CGGCCGACGT CTACCTCCAG
GCGGCGCCGA TGTCCGCGGT CGAGGCGGCC GCGACCATCT CGATGCAGGT CACCTATTGG
GCGACCGCCG GTCAGGGCTT CACCGTGCAG TACAGCACGC CGACGAACAA GTACCAGAAC
GGACCGAGCG TCACCAGCCC GGGCACCGGG ACCTGGACCA CGGCGACCGT CCAGCTCACG
AACGCCCAAC TCGGCGAGTT GGAGAACGGA GGCGCCGACC TGCGGCTCGC CGTCGCCGAT
GTCACCACGC CGCTGATCGT GCGCAGCATC ACCATGTCGG CCGGGAACAG CAGCGCGCCG
GTCCTGGCCG CGACGCCGAG CTCGCTGTCG TTCGGCAGCG TGAGCACCGG TTCGACCAGC
GCGGCCCGCA CGGTGACCAT CACCAACTCC GGCAACGCCG CGGCGAGCGT TTCCAGCATC
TCGACGACCA GCGGCTTCGC CCAGACCAAC ACCTGCGGAT CCAGCATCGC CGCGGGAGCG
AGCTGCACGG CGAGCGTCAC CTTCTCCCCC ACCGCCGCTC AGACCTACAG CGGCAACCTG
ACGGTCACCA GCACCGCGAC CGGCAGCCCT CTGATAGTCG CGCTGTCCGG AACGGGCACG
AGTTCGAGCA CGAACCTCGC GCTCAACAAG CCGATCAGCG CCTCTACTGT CCAGCAGAAC
TACGTCCCCA CCAACGCCGT CGACGGCAAC ACGGGCACGT ACTGGGAGAG CCGGGACGGG
ACCTGGCCGA GCAGTCTGAC CGTGGATCTG GGTTCGACAC AGACGCTCAG CCACACGGTC
ATCGACCTGC CACCGCTGTC CGTCTGGCAG ACGCGGACCC AGACCCTGTC CGTCCTGGGC
TCGACCAACA ACTCCACCTG GACGACCATC GTCGCCTCAG CGGTCTACAC GTGGAATCCG
AGCACGGGCA ACACCGTGAC CATCACGTTC CCCGCCGGCA CGGCGTACCG GTACGTGCAG
CTGAACTTCA CGGCGAACAA CGTGCAGAAC GGCGCGCAGG TCTCCGAGTG GCAGCTCTTC
GGCTGA
 
Protein sequence
MRISRRTTGA LLAGALALAG LSTSAALTAA PAHAASAPTT PIWSTQLDFD NGGAAWSEPY 
FAALAAKGLT TAELNMPWGT IEPSAGTFSF TIWDQELANA AAAGIQLIPV FWQSGWGGSP
APWITDLEKT STGAAGVAPD WWNTTEQAQY FTYVENTIQN SIAQPGGYGG AVLDYGFLDA
QWDISGSGGG YASGDITEFQ NVYLPNAFGT IAAFNAAEGT SYTAFSQVPA QASGQPLFGV
FQAFRAWSVE QTYGALTAAV RKITANTPLY YYYGGSYGNV TNYANNPDSF FKLAKQYNVT
IIADSASNTG MTLAMTSLGR AYGVKVAEEW TAPNSDSELA AYAVQWLDSY GMTFPQAGGE
DFFIHDGTSK DTVGYPIYTS WLPTLKSLSG TYPQQPTALY IDVSQGYGNT NGGSLNTVES
QAAAIWNSFQ SGLAVVTSQE VANGAVSLSS FNAVLPLNGV DANLTSYKNG GGALLTSAAQ
LTQHASAYAV IDAPYVGDVQ AVPVLAASHT SASLTLADIT TGTAYNAPIA INPAGLGLNS
GSYYVVNAAG TALPQTVQSN GQICVSANLG AASLAEWTVK AGPVPAGTAS SGCPTTYTGA
TSVSATAGQS GGGLTFLGVG ATNQGSDGNL TQITQGGQTA YETWTSAQSG ATGSADVYLQ
AAPMSAVEAA ATISMQVTYW ATAGQGFTVQ YSTPTNKYQN GPSVTSPGTG TWTTATVQLT
NAQLGELENG GADLRLAVAD VTTPLIVRSI TMSAGNSSAP VLAATPSSLS FGSVSTGSTS
AARTVTITNS GNAAASVSSI STTSGFAQTN TCGSSIAAGA SCTASVTFSP TAAQTYSGNL
TVTSTATGSP LIVALSGTGT SSSTNLALNK PISASTVQQN YVPTNAVDGN TGTYWESRDG
TWPSSLTVDL GSTQTLSHTV IDLPPLSVWQ TRTQTLSVLG STNNSTWTTI VASAVYTWNP
STGNTVTITF PAGTAYRYVQ LNFTANNVQN GAQVSEWQLF G