Gene Caci_4120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4120 
Symbol 
ID8335474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4661555 
End bp4662754 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content69% 
IMG OID644957223 
Producthypothetical protein 
Protein accessionYP_003114825 
Protein GI256393261 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.29772 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAATG TGTTCACCGT GTGCTTCAGC GGAACATCGT GCACACGGGA CGAGGGAGAG 
GTATCACGTC CCGGCAGCGA CAAGGACATC TACGACCCGG CGACGGGCTA TATCCCGGTC
CGGATCCACA AGGAGCTCAC CGGCAGTCTG ACCGCGACCA GTCCGAGCGT GACGGTGCGC
GGCGTCGGCG AGAACGACTG GGCGGTGCCG CGCAACAACA GCGAACCACT GGTGTTGGAC
GGTCCGCTCA GGGCGCCAGC CGATCTGCTC AAGGACGCCA AGCCGTACTC CGGCGGCGAT
CAGCGTTCGA GGGTGTCCGC GCTGTCGGGG TGGGACGCCG CCGCGCTCGC GCTGCACGCC
GCGAACCTGG CGGCACGCAG CGGGGCGAAG GCGTTCAACT TCATCGGGCA CAGCCGCGGC
GCGGTGGAGT GCGTCATGGC CGCCTGGTTT TTGCAGGCGT ACGGCTCGCC GGAGGTGCAG
GCCGTCCCGG TGCGCATCCT GGCGATCGAC CCCGTTCCCG GACCGGGCAA CTGGTACGGG
ATCTTGACCC AGCTTCCGCC GAACGTCGTG GAGTACGTCG GCGTCACCGC CTGGGACATG
CTCGACACCG GCTTCGACGG CGTCGTGCCG CGGCCCAACG CCAAGATGGC GGGCACCTCG
CAGACCCTGA AGCTGGGCAC CGGCAGCTGG ACCAAGCTCG CCGACAACTA CCAGCTCACC
GATCCGCTGG CGCCGGCCAA GAGCGGCATG GGGCAGCCCA CCGGATACCG GCTGTTCGCC
TGCCGGGGAC GGCACGCGAC GGTCGCGGGG AACATGACCA GCAACGGTGA GTACAACGCC
GCGGACGTCA ACGCGAGCGC CGCGCGGGTG CCCGAGCTGG TCTACCGGCT CGCGCGCGCC
TACCTCACCA GTTGGGGCTC GGAGTTCAAG GTGAAGTCCG GGGTCGACAC CTACTCGCTG
CCGCTGCGGC AGCAGATCAA CCTCGACCAG GCGGTGTTCG ACAACATGGC CGGCGGACCG
CTGCGCGACA GCGTGCGGCC GGGGCGGCCG TACGTGCGTC AGGTCTCGTC GATATCCGGG
CGCAATCCGT TCAACACGTA CTACCTCGAA GACGTCGTGG GCGATCCGCC ATACCGGCAG
CCGTACCCCG TGACCGCCGC CCGCACGGGC GCCGGCTGGG TCGACTGGAC CTTTCTGTAG
 
Protein sequence
MGNVFTVCFS GTSCTRDEGE VSRPGSDKDI YDPATGYIPV RIHKELTGSL TATSPSVTVR 
GVGENDWAVP RNNSEPLVLD GPLRAPADLL KDAKPYSGGD QRSRVSALSG WDAAALALHA
ANLAARSGAK AFNFIGHSRG AVECVMAAWF LQAYGSPEVQ AVPVRILAID PVPGPGNWYG
ILTQLPPNVV EYVGVTAWDM LDTGFDGVVP RPNAKMAGTS QTLKLGTGSW TKLADNYQLT
DPLAPAKSGM GQPTGYRLFA CRGRHATVAG NMTSNGEYNA ADVNASAARV PELVYRLARA
YLTSWGSEFK VKSGVDTYSL PLRQQINLDQ AVFDNMAGGP LRDSVRPGRP YVRQVSSISG
RNPFNTYYLE DVVGDPPYRQ PYPVTAARTG AGWVDWTFL