Gene Caci_3834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3834 
Symbol 
ID8335187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4341613 
End bp4342812 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content70% 
IMG OID644956971 
Productprotein of unknown function DUF58 
Protein accessionYP_003114574 
Protein GI256393010 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00386989 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0608669 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCGAT CGGGTGTGGC CGTCGCCTTC GCCGCCGCAC TGCTCGGAGT GCTCGGGGCG 
GTGAGCCGGT ACCGGGAACT GCTGCTCCTG GCCATCGGCT GCGCGACAAC ACTGGCGATC
GCGATCGCTT GGGCTGCCTC GAAGAGAACC AACCTGGTCG CGACCAGTGA GTACGCACCC
GCACGGCCCG AGGACGGGCA GCTGGTCGAG GCAACCGTGC ACGTACGGAA CCACGGGCGG
CGTACCAGCC GGCCGATGGT CGCCGTGGAG CAGGTGGGCG CCGACGCCTA CGGACTGGAA
ATCCCGGAAC TGGCTGCCGC ACAGAAACAC GACGGGACCT ACACGTTCGT CGCACCGCGA
CGCGGACAGC TGACGGTCAG CCGCGCGCCG GCAGCGAACA CCGATCCGAT CGGCTTGGTC
CGCAGAACCG AACTGGACGG TCAGGACACG CAGATCCGTG TGTATCCGCG CTGGCACAGC
GGAATCGCGC CGATCCTGGG CCCGGACGCG CGCGTCGGCC GCGGCACGGT CGGGGTACCC
CGCGGCGAGT ACGACTTCCA CTCGCTGCGC GACTACGAGC CCGGTGATCC GCTACGGCTC
ATCCACTGGC GCGCCACGGC GAAGCGCGGG GAGCCGCTGG TGCGGCGCCT GGAGGTGCCG
GACGAGGCCG AACAGCTGAT CGTGCTGGAC AACAGCGCAC TCTCGTTGAA CGCAGAGGAT
TTCGAGCACG CGGTGCGGGT CGCCGCCTCG CTGGCGGTCG CCGCGCGGCG AGCGGGCCTG
GCCTTGGAAC TGCGCACCGT GTGCGGGCCC GCGGTCGCGC GGCTGCGGCG CACGGGCCGG
TCCGCCAGCG CCACCGCCGC GATGGAGCTC CTGTGCGACG TCGAGCAGAT GCCGTTGAAA
CAGGGCCCTG ATCTGGCTGC CGTGCTGGCC GGCTTGGGAC GCGGCCGGGC AGACGTCGCG
CGACGCGGCG CGGCGTTCCG ATCGGAGAAC GCCGGCCCGG TCGTACTCGG CGTGGTGACC
GGCTTTCTCA GTACCCGTAC AGCGACGGCA CTGAGCCGGG CCCGGCAGAG GTTTGAGGCC
GCCTATGTCG TCCAGGTCGG CGAAAAGGTA CCTGTGACCC GTGTCAAGGA TGTCGAATGC
GTTCGCATCA AGACCAGTGA GGACCTTGTG GGGCAATGGA AGCGCCTGGT ACGCGGCTGA
 
Protein sequence
MTRSGVAVAF AAALLGVLGA VSRYRELLLL AIGCATTLAI AIAWAASKRT NLVATSEYAP 
ARPEDGQLVE ATVHVRNHGR RTSRPMVAVE QVGADAYGLE IPELAAAQKH DGTYTFVAPR
RGQLTVSRAP AANTDPIGLV RRTELDGQDT QIRVYPRWHS GIAPILGPDA RVGRGTVGVP
RGEYDFHSLR DYEPGDPLRL IHWRATAKRG EPLVRRLEVP DEAEQLIVLD NSALSLNAED
FEHAVRVAAS LAVAARRAGL ALELRTVCGP AVARLRRTGR SASATAAMEL LCDVEQMPLK
QGPDLAAVLA GLGRGRADVA RRGAAFRSEN AGPVVLGVVT GFLSTRTATA LSRARQRFEA
AYVVQVGEKV PVTRVKDVEC VRIKTSEDLV GQWKRLVRG