Gene Caci_4770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4770 
Symbol 
ID8336124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5431377 
End bp5432528 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content75% 
IMG OID644957870 
ProductSel1 domain protein repeat-containing protein 
Protein accessionYP_003115472 
Protein GI256393908 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCGAGG TAGAGCAGCA GATCGGCGGC GGGGCGAATG AGGCCGACGG GCTCCAGGTC 
GGCGGCGCCG ATGCGGTGAC GGAACTGGAG ACCCAGGCGC CGGCGGCGGC GTTCGACGCC
GACCCGGTGG TGGCCGCGGG CTTCGTGGCC TTCGAGGCCG GGAACCTGGA GGCCGCCGAG
GCGGTGTTCG GGCCGGCGGC GCAAAGCGGC GGCCGCGCCG CGATGTTCGG GCTGGGCCTG
GTGTATCAGG CCCGTGACGA CTTCGACCAG GCGGTCCGCT GGTACGCCGC GGCCGCCGAA
CTCGGCGAGG CCGAGGCGGC GAACAACCTG GGCACCCTGC TCGCGGTGCG CGGCGAGAAC
GACGCCGCGG TGGCCTGGCT CAGCCAGGCG GTGGCGCTCG GCTCCTCCGA GGCCGCGGTC
AACCTGGGCC GCATGAACCA CTGGGACAAC CCGGCCGAGG CCGAGTTCTG GTACCGCCGC
GGCGCCGAGG CCGAGGTCGG CTCGGCGATG GCGAACATCG GCGTCCTGGC CCAGCGCCGC
GGCGACCTGG CCCAGGCCGC CGAGTGGTAC CGCAAGGCGG TCCAGGCCGG CGAGGTCCGC
GCGGTGAACA ACCTCGGCAA CGTCCTGCTG ATGCAGGGCG AGGTCGAGGA GGCGATGGAC
TGCTTCAAGC GCGGTGCCGA GGACGGCGAC GGCCACGCGC AGGTGAACTT CGCCCTGCTG
TCCCTGGCTC GCGGCGAGGT CGAGGAGGCC CTTTCAGCCG CCGAACGCGG CCGCCGATCC
GACGCCCCGG GCGCCGAGGT CGTCTACGGC CAGGTCCTGC TCGCCACAGG CGACCAGCAG
GGCGCCGCCG AAGCCTTCCA GCGCGCGATG AACGCCGGCG ACCCCTCCGG CGCCTACGCC
CTCGGCGTGG TCGCCGCCCA GACCGGCGCC CTGATCGACG CCGAGCGCCT CTTCCTCGCC
GCGGCCAACG CCGGCCACGT CCCCGCGATG TGGAACTCGG CCGTCCTGCT GCAGCAGAGC
GGCAAGACCG AGGCGGCACT GCCGTGGTTC GAGGCGGCGG CCGCCGCCGG ACACCCGGAG
GCCCAGGCGG TGCTGTCGGG CGCCATCTCG GTGCAGCCGG GCGGGGACGC CGGAGGCGTC
GCCACGGCTT GA
 
Protein sequence
MAEVEQQIGG GANEADGLQV GGADAVTELE TQAPAAAFDA DPVVAAGFVA FEAGNLEAAE 
AVFGPAAQSG GRAAMFGLGL VYQARDDFDQ AVRWYAAAAE LGEAEAANNL GTLLAVRGEN
DAAVAWLSQA VALGSSEAAV NLGRMNHWDN PAEAEFWYRR GAEAEVGSAM ANIGVLAQRR
GDLAQAAEWY RKAVQAGEVR AVNNLGNVLL MQGEVEEAMD CFKRGAEDGD GHAQVNFALL
SLARGEVEEA LSAAERGRRS DAPGAEVVYG QVLLATGDQQ GAAEAFQRAM NAGDPSGAYA
LGVVAAQTGA LIDAERLFLA AANAGHVPAM WNSAVLLQQS GKTEAALPWF EAAAAAGHPE
AQAVLSGAIS VQPGGDAGGV ATA