Gene Caci_3090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3090 
Symbol 
ID8334442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3395204 
End bp3396211 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content72% 
IMG OID644956237 
Productprotein of unknown function DUF35 
Protein accessionYP_003113840 
Protein GI256392276 
COG category[R] General function prediction only 
COG ID[COG1545] Predicted nucleic-acid-binding protein containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.736409 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.697465 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGC AGAACACGCA GAACGCGCAG ACACCCGATT CCGACGACCG CGCCTTGCAC 
GTCCTGGAGT TCCCCGGCGG ATACACGCGC TCCACCGGCC CGGTGATCGG GCGTTTCCTG
ACCGGTCTGC GCGACGGGCA GCTGCTCGGC GTACGGACTC CCGACGGCAA GGTGCTGGTC
CCGCCGACCG AATACGATCC GCAGACCGCG GCCGCCCTTG GCGACGCGGA GGAGGACTGG
GTCCAGGTCG GTCCGGCCGG GACCGTGACC AGCTGGACGT GGGTCGACGC CCCGCGCGCG
GACCATCCGA TGAACCGCCC TTTCGCCTGG GCGCTGATCA AGCCCGACGG CGCGGACACG
GCGCTGCTGC ACGCCGTGGA CTCCGGGTCG AAGGCGGCGA TGGCGACCGG GATGCGGGTG
CATCCGTCGT GGCGGGCTGA GCGCAGCGGT TCGATCAAGG ACATCGCGTT CTTCGCTCCG
GGGGAGGGAC CGGCTGAGGT GCCGGCGCTG GCAAGCGAGG CAGCGCTCGA ACCCGTCTCG
GTGGTCACGC TGCCGCACCG GCTGGAGTAC CGGCTGCGCC CCGGGACGGT CTGGAACCAC
TTCATCGACG GCATGGCCGA GGGTCAGATC CGCGCGACGC GGTGCCCGGC GTGCGGCAAG
GTCTACGTGC CGCCGCGCGG CGCGTGCCCG GCGGACGGAC TGCCCGCGAC CGAGTGGGTG
GACCTGCCGG ACACCGGGGT GCTGACCACG TTCGCGGTCA ACAACGTCCC GGCGGCCGGC
GCGCCCGAGG TGCCGTTCAT CAGCGGCTAC GTGCTGCTGG ACGGCGCCGA CATCGCGATG
CTCGCGCTGG TCTCCGACGT GCCGTGGCAG GAGGTGCGGA TCGGGATGCG GGTGCGGGCG
GTGTGGGTGC CGGACGCCGA GCGGACGCGG TCGGTGAAGA ACCTGAAGTG GTTCGCGCCG
ACCGGCGAAC CCGACGTCCC CTTCGAGCGC TTTGAGGAGT ACGTGTGA
 
Protein sequence
MTTQNTQNAQ TPDSDDRALH VLEFPGGYTR STGPVIGRFL TGLRDGQLLG VRTPDGKVLV 
PPTEYDPQTA AALGDAEEDW VQVGPAGTVT SWTWVDAPRA DHPMNRPFAW ALIKPDGADT
ALLHAVDSGS KAAMATGMRV HPSWRAERSG SIKDIAFFAP GEGPAEVPAL ASEAALEPVS
VVTLPHRLEY RLRPGTVWNH FIDGMAEGQI RATRCPACGK VYVPPRGACP ADGLPATEWV
DLPDTGVLTT FAVNNVPAAG APEVPFISGY VLLDGADIAM LALVSDVPWQ EVRIGMRVRA
VWVPDAERTR SVKNLKWFAP TGEPDVPFER FEEYV