Gene Caci_3717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3717 
Symbol 
ID8335070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4182383 
End bp4183693 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content68% 
IMG OID644956857 
ProductCarbohydrate-binding family V/XII 
Protein accessionYP_003114460 
Protein GI256392896 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.609198 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0379459 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGATCG GAAAACGGAA CCTTCGCGCA GCACTGACAG CCGGCTCGGC GTTCGCGCTG 
GCAGCCGGCG CGGCCGTCCT CGCGGTGGCT GCGGCTCCCA GTGCACAGGC CGCCACGTGC
GCCGCGCCGT GGAGTGCCGC CACGGTCTAC ACCGGCGGCC AGCAGGCCAG CGAGAACGGG
ACCAACTACA CCGCCAACTG GTGGACGCAG GGCAACGACC CGGCGACCAA CAACGGCGGA
TCCGGCACCG GACAACCGTG GACGTCCAAC GGCGCGTGCA CCGGCGGTAC CGGCGGCGGC
ACAGGCGGCA CAGGCGGTGG CACCGGCGGC GGAACCGGCG GCACCGGTGG CGGAACGGGC
GGGGTGAGCG GCCTGCTGCT CAGCCCGTAC AAGGACGTCA CCGTCAACAT GAACTGGAAC
ACCTACCAGA TGCAGTCGGC GGTGACCGGG TCCGTCATAC CGGTAGTCGG TTCCGGAAGC
CTGGTGTCGC AGTACGTTCC GAAGCTGCCC GCGATCACCC TGGCGTTCGC CACCGGCTCC
TGCGGCAGCG AGACCTGGGG CGGCGTCCCG GCTGCCAACT TCGCCTCGGA GAACGTGGCC
CAACTGCACG CCGCCAACCT GAACTACGTC GTGTCGACCG GCGGCGCCGC CGGCAGTTTC
ACCTGCGCCT CGGCGTCCGG CATGAAGTCC TTCATCGCCC GCTACGCCAG CTCGAACCTG
GTCGGCATCG ACTTCGACAT CGAAGGCGGC CAGAGCGCGT CGGACATCCA GAACCTCGTC
GCCTCCGCGG TCGGAGCCCA GTCGCAGTAC CCGAACCTGC AGTTCTCCTT CACCCTGGCC
ACCCTCGGGG CCTCCGACGG CAGCTACGGC GGAGTGAACT CCCTCGGCAA CACGGTCGTG
CAGGCCGTCC GCGGCTCCAG CCTGAACCAC TACGTCATCA ACCTGATGAC CATGGACTAC
GGCAGCGCCT CCAGCAGCGT GTGCGTCGTC TCCGGCGGCA CCTGCCAGAT GGCCCAGTCG
GCGATCCAGG CGGTGAAGAA CCTCGAGCAC ACCTACGCAA TCCCGGCCAG CAAGATCGCC
GTCACCCCGA TGATCGGCAT GAACGACGCC ACCAGCGAGA TCTTCACCGT CGCCGACGTC
AACACCCTGT CGTCCTACGC CGTCAGCAAC GGCCTGGCCG GCCTTCACTA CTGGTCACTG
GACCGAGACA CCCCCTGCTC AAGCTCCTAC GCGTCCCCCA CCTGCAACTC CGTCCCCAGC
ACGACCCCGC TGCAATACAC CAAGCAGTTC ATGAGCGACA CCGGGCACTG A
 
Protein sequence
MRIGKRNLRA ALTAGSAFAL AAGAAVLAVA AAPSAQAATC AAPWSAATVY TGGQQASENG 
TNYTANWWTQ GNDPATNNGG SGTGQPWTSN GACTGGTGGG TGGTGGGTGG GTGGTGGGTG
GVSGLLLSPY KDVTVNMNWN TYQMQSAVTG SVIPVVGSGS LVSQYVPKLP AITLAFATGS
CGSETWGGVP AANFASENVA QLHAANLNYV VSTGGAAGSF TCASASGMKS FIARYASSNL
VGIDFDIEGG QSASDIQNLV ASAVGAQSQY PNLQFSFTLA TLGASDGSYG GVNSLGNTVV
QAVRGSSLNH YVINLMTMDY GSASSSVCVV SGGTCQMAQS AIQAVKNLEH TYAIPASKIA
VTPMIGMNDA TSEIFTVADV NTLSSYAVSN GLAGLHYWSL DRDTPCSSSY ASPTCNSVPS
TTPLQYTKQF MSDTGH