Gene Caci_3839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3839 
Symbol 
ID8335192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4346506 
End bp4347936 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content67% 
IMG OID644956975 
Productprotein of unknown function DUF1254 
Protein accessionYP_003114578 
Protein GI256393014 
COG category[S] Function unknown 
COG ID[COG5361] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.190359 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.11168 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGCCT CCCGGCCTGA CGCTGTCGCT GATGCGGCGC CGCCGCTGGT GCGGCAGATC 
AACGACGGCA GGTGGCTGGA CCAGCGGGAG GCCGAGGAAC TGCGCTCCGA GTTGTTCTTC
CACCGGGCGG TCCACGCATA TCTGACGATG CTCCCGGCGC TGAACGTTAT CGGGATGCGG
GACGGGTCCG AGGGTGCGTT CGGTGCCGGG TATCACGTGC TGCCGGTGTG GAAGGACCGG
ATGGACAGCA GGACGTGGGT GCCGACCCCG AACGCGGACG TCATCTACTC GATGGGCTAC
CTGGACCTCG GGGAGACCGG GCCGTTGGTG GTGAACGCCC CGGCGAACGT GATCGGGATG
TTCACTGACT TCTTCCAGCG CACCATCACC GACGTCGGCG CGATCGGGCC GGACCGGGCG
CGCGGCGGGC TGTACCTGCT GCTGCCGCCC GGCTACGACG GCCATGTCCC GAACGGGTAC
TTCACGTTCC GGTCCTCCAC GTTCAACGTG TTCCTGTTCT TCCGCACGAT CATGGGCAAG
GGCGACGGCG GGCCGGATCC GTCGGTCGGC GCGGCCACGG CCGAGCGGAC CCGGATCTAT
CCGCTGTGGG AGGAGGAGAA GGACGTCCTG CCGATGCAGT TCCCGAACGC GAGCGGCGTC
CGGGTGAACA TGATGTACCC GACGGACTTC TCCTACTGGA CCATCTTGAA GGAGTTCGTC
GACTTCGAGC CCGTCGGCGC GATCGTTCCG GAACTGCGCG GCGTGCTGGC CTCGATCGGC
ATCGTCAAGG GCGAGCCGTT CGCCCCGAAC GCCTGGCAGC GCGAGCAGTT GGAACGTGCC
GTCCGGGTCG CTCCGCGGAT GACGCTCGCC CTGGCCCAAC TCGGCCGGGA CGACCAGCGC
AATCTCTACT ACACCGACCG GCAGTGGGAG CAGGCTTGGT GCGGCGGCAC CGCGGAGTGG
ATGCAGGCCA GCTACCTGGA CATCAACGCC CGCTCACGGT TCTTCCAGTA CGCCTATTCC
TCGGCCCCGG CGATGGTCGT GCATAGCACC GGCGCCGGCT CGAAATACCC GTACTCCGCC
CGCGACGCCG ACGGGGCGTT CCTGGAGGGC GCGAAGACCT ACCGGCTGCA CCTGCCGCCG
AACCCGCCGG CCGACCTGTT CTGGGCAGTG ACCGCCTACA ACATCACTGA CGGCACCATG
CCCGAGACCG AGCAGCTGCT GCCGTCCACG AACAGCTACT ACGACATCCC CAAGAACGAT
GACGAGTCGG TGGACGTCTG GTTCGGTCCG CGGAAGCCCG ACGGCGTCGC CGACCACGCT
TTCATCCAGA CCGTGCCCGA CCGGAACTTC GTTGTGGCGC TGCGCCTGTA CGGCACGGCG
CCGGCCTTCT ACGACCAGAC CTGGAAGCCG GACGACATCG TCAAGGCATG A
 
Protein sequence
MHASRPDAVA DAAPPLVRQI NDGRWLDQRE AEELRSELFF HRAVHAYLTM LPALNVIGMR 
DGSEGAFGAG YHVLPVWKDR MDSRTWVPTP NADVIYSMGY LDLGETGPLV VNAPANVIGM
FTDFFQRTIT DVGAIGPDRA RGGLYLLLPP GYDGHVPNGY FTFRSSTFNV FLFFRTIMGK
GDGGPDPSVG AATAERTRIY PLWEEEKDVL PMQFPNASGV RVNMMYPTDF SYWTILKEFV
DFEPVGAIVP ELRGVLASIG IVKGEPFAPN AWQREQLERA VRVAPRMTLA LAQLGRDDQR
NLYYTDRQWE QAWCGGTAEW MQASYLDINA RSRFFQYAYS SAPAMVVHST GAGSKYPYSA
RDADGAFLEG AKTYRLHLPP NPPADLFWAV TAYNITDGTM PETEQLLPST NSYYDIPKND
DESVDVWFGP RKPDGVADHA FIQTVPDRNF VVALRLYGTA PAFYDQTWKP DDIVKA