Gene Caci_3884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3884 
Symbol 
ID8335237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4399376 
End bp4400575 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content67% 
IMG OID644957010 
Producthypothetical protein 
Protein accessionYP_003114613 
Protein GI256393049 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00861277 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0141453 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACGGAA CCGGAGTAAG GGACAAACAC CCGGCGCTTT TCGACCGGGT CTGCCGCGGC 
TTCGCGAAGC TGGATCGGCC CGATCTCGTC GTCGAGTCCT GCTTCTGGGG CGAAAAGCAC
GGTGCGAAGC TCGCCCTTGG CGGCGCATCC ATCCCCGCCT ATCTGAGCGC CGCACCCGAC
TACTCCGCCA TAGACGACCC CGATGAAGAA CTGGTCGGCT TCTGGGAGAT CCTCCAGGAG
GATCCCTTCG CCCAGCTCCA GCTTATGGCC GAGTTCGAGC AGCCGGGTCC TGACGACGAG
GTGCTGCCGC CCAGCTTCCA CACACCCGGG GCAGACGTTC TGTCCCGTCT CACCGCGCTG
AGTCCCGCCG GGGAACTCGC CGATCTGCTT GCCGAGACGG GCCTGGCCCC GTACCTGGAG
CCAGCCATGC GGCGGCTCCG CGCCGCGCCT GAGCTCGGCA AGGCCGCCGT GGCGCGCGCT
GAAGCGCCGG GAGACTTGGC CGAGGCGGTG GCCTGCGCGT TGCTGGCGTT GTTACTTACC
GTCGCTGATG AATCCGCGCC GACGGGCGCT CAGCTGGACC GACTGGCGGA CTTGCTGGTC
GCCGCGCTGG GCGCCGGGGA GCGATCCGGG CGGGTGTCGA AGACGGCACA GTTCCTCGGC
AGGCGTGCCT TGAACGTGAC GACGCAGCCG TTGCTGCGCG TCTTCCGGAA CGGGCTGACC
GAGGCAGCGG TCCCGATGCT CGGCGACATC CTGAACTACC AGGCCCACGG TGACGGTTTG
CGCGACTTCC TCAGAGCACG GATATTCGCC AGCGACGAGC ATACGATCGT CCTGGGCCAC
AGCCTCGGCG GAGTGGCCCT GGTCGACCTG CTTGCCGCCG CCACGCCAGG GGAATTCGGC
CAGGTCCGGC TGCTGGTGAC GGTGGGGTCG CAGGCTCCGT TCTTGTATGA GTTGGGCGCG
TTGCACAGTC TGCCGCTCGA CAGCGCCGAC ACGGCGGATG CGTTGGCTCG TATGCCCAAG
TGGCTCAACG TCTACGATCG CCGCGACCTG TTGTCCTATC TGGCTGCGCC GGTGTTCGGC
CCGGAGGGCG TCGAGGACTT CGAGGTCGAC TGCCGGCAGC CGTTCCCCGC GGCTCACAGC
GCGTATTGGA ACCTGGACCG CGTCTATGAA CGGATTTGCC AGGAGATCGC ATGGACTTAG
 
Protein sequence
MHGTGVRDKH PALFDRVCRG FAKLDRPDLV VESCFWGEKH GAKLALGGAS IPAYLSAAPD 
YSAIDDPDEE LVGFWEILQE DPFAQLQLMA EFEQPGPDDE VLPPSFHTPG ADVLSRLTAL
SPAGELADLL AETGLAPYLE PAMRRLRAAP ELGKAAVARA EAPGDLAEAV ACALLALLLT
VADESAPTGA QLDRLADLLV AALGAGERSG RVSKTAQFLG RRALNVTTQP LLRVFRNGLT
EAAVPMLGDI LNYQAHGDGL RDFLRARIFA SDEHTIVLGH SLGGVALVDL LAAATPGEFG
QVRLLVTVGS QAPFLYELGA LHSLPLDSAD TADALARMPK WLNVYDRRDL LSYLAAPVFG
PEGVEDFEVD CRQPFPAAHS AYWNLDRVYE RICQEIAWT