Gene Caci_4938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4938 
Symbol 
ID8336292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5636427 
End bp5638103 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content68% 
IMG OID644958037 
ProductABC-type sugar transport system periplasmic component 
Protein accessionYP_003115639 
Protein GI256394075 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000971011 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAGCA CCAACGCCTT CACGCGCCGC GGATTCCTCG CGGCCTCCGC CGGCGCCGCC 
GGCGCGATCG GGCTGTCCCC GCTGCTGGCG GCCTGCGGCA ACAACGGCGG CAAGGGCGGG
GCGAGCACCA AGGCGGCGAT CCAGGCCGTG CTGCCGACGT ACAAGCCGCT GTCCGGCGGG
GTCACCCCCG ACATCCCGTC CGTGCCGGGC ACGAACGGCG CCATGACCGA CCCGGGTTTC
CTGAAGTACC CCAGCACCCT GCCGAAGACG GTGACCGGCC AGGTCGGCTC CGGCGGGAGG
TACGCCGGCG TCGCGCCGTC GTGGAACCCG GTCCCGCCGG CCGGCAACTC CTACTACGAG
GCGGTGAACA AGGCGCTCGG CGCGACCTTC GTCTCCCAGC CGGCCAACGG CAACACCTAC
AACACCGTCA TCCCGCCGCT GATCGCCGCG GACAAGCTGC CGGACTGGCT GTCCATCCCG
GGCTGGCTGA ACCCCACCTT CGACACCGGC GGCCTGGTCG GCACCAAGCT GGCCGACCTG
ACGACCTACC TCGGGGGCGA CGCGGTCCTG GAGTACCCGA ATCTCGCGGC CATCCCCAGC
GGCGGCTGGA AGTGCGGGAT CTGGAACAAC CGGCTCTACG GCATCCCGTC GCAGACCGAC
AGCCTGAGCT TCGCCGGCGC CATCTACTAC CGCAAGGACC TGCTGGACGC CAAGGGCATC
ACCCCGAACG TCAAGACCGC GCAGGACTTC GAGGCCCTCG GCCGGGAGAT CAACAACCCC
GGCGGCGGCG TGTGGGCGTT CGACGACATG CTGGTGTACC TCTACCAGGT CTTCAAGGTG
CCGCTGGGCG GGTGGTACCT GGAGAACGGC AAGATCAAGA ACGTCGGCGA GCACCCCGCC
ATGCTGGAGT GCCTGGCCTG GGCCAACAAG ATCGCCAAGG CCGGGTTGGT CCACCCCGAC
GCGATCGCCG GAGTGAACAC CAGCAACCCC AGCCGGTTCA TGGCCGGCAA GGTGTACATC
GAGGCCGGCG GCATGGCCGG CCTGAGCGGC CCGGACGCGA AGAACGGCAC CGCGGGCAAG
GCCGGCTACC AGCGCGCGCT GTTCCCGCTG TTCTCCTCCG ACGGTTCGAC CCCGAGTATC
GGCCTGGGCG GCTCCTCGGG CTGGATGAGC TATCTGAACA AGAATCTGAA CCCGGAGCAG
ATCAAGGAGT GCCTGCGGAT CGCGAACTTC TTCGCCGCGC CGTTCGGGTC CTTCGAGTAC
AACCTCCTCA ACTACGGAGT CGAAGGCGTC CACTACACGA TGGGCCCTGA AGGACCGGTG
TTCACCAAGG AGGGCTCCAA CACGGCGGCC GACGGCATAT TCGGCTTCTT CAGCACCGCT
CAGACCGCGG TCTACAACGC GGGGTACCCC GACGTCACCA AGGCCATGGA GGCTTGGTGC
GCCGACGCGG CCAAGCACGC CTACAAGCCG ATGTTCTGGA ACCTGAACAT CAGCGTGCCC
AGCCAGTTCT CCAAAACCGC CGCCCAGACC GAGTTGTGGG ACGCGACGCA GGCGGTGGCG
CACGGAAAGC AGCCGGTGTC GTACTACCAG GACGCGTACT CCCGGTGGAA GAGCGGCGGC
GGCGACGCCC TGGGGACCTG GTACCAGCAG AACCTTATTG ACAAGGGCCT CAGCTAG
 
Protein sequence
MSSTNAFTRR GFLAASAGAA GAIGLSPLLA ACGNNGGKGG ASTKAAIQAV LPTYKPLSGG 
VTPDIPSVPG TNGAMTDPGF LKYPSTLPKT VTGQVGSGGR YAGVAPSWNP VPPAGNSYYE
AVNKALGATF VSQPANGNTY NTVIPPLIAA DKLPDWLSIP GWLNPTFDTG GLVGTKLADL
TTYLGGDAVL EYPNLAAIPS GGWKCGIWNN RLYGIPSQTD SLSFAGAIYY RKDLLDAKGI
TPNVKTAQDF EALGREINNP GGGVWAFDDM LVYLYQVFKV PLGGWYLENG KIKNVGEHPA
MLECLAWANK IAKAGLVHPD AIAGVNTSNP SRFMAGKVYI EAGGMAGLSG PDAKNGTAGK
AGYQRALFPL FSSDGSTPSI GLGGSSGWMS YLNKNLNPEQ IKECLRIANF FAAPFGSFEY
NLLNYGVEGV HYTMGPEGPV FTKEGSNTAA DGIFGFFSTA QTAVYNAGYP DVTKAMEAWC
ADAAKHAYKP MFWNLNISVP SQFSKTAAQT ELWDATQAVA HGKQPVSYYQ DAYSRWKSGG
GDALGTWYQQ NLIDKGLS