Gene Caci_0044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0044 
Symbol 
ID8331369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp45480 
End bp46550 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content70% 
IMG OID644953211 
ProductRhomboid family protein 
Protein accessionYP_003110840 
Protein GI256389276 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCAG GCAGCCCAGG CAGCCCCTCC GTGCCAGCCA CCGAACTCCC GGCGTGCTAC 
CGGCATCCGG GCCGCGAGGC GCAGATCCGC TGCACCCGCT GCGATCGCCG GATCTGCCCG
GAGTGCATGG TCCCGGCCTC GGTCGGGTTC CAGTGCCCGG AGTGCGTGCG CGGCGGCAAC
CAGGAGGTGC GCAAGGCGCG GTCCCCGTTC GGGGCGGTGC TGCGCCCCCG GGTGGTCCCG
GTCGTCACCT ACAGCCTGAT CGCGCTGAAC TTCGTGATGT TCGGCCTGCA GCACATCGTC
GGCACCTCGC AGGTCGGCGC GGCCGGCGGC GGGGTGTTCC AGGTGAACAC CCTGGACATG
CGGCTGGAGC TGATCGCCAA GGGCACCTGG GTCGACGGCC AGCCGATAGG CGTGGCCAAC
GGCGAGTGGT ACCGCCTGGT CACCTCGATG TTCCTGCACG CGAACCTGAT CCACATCGCC
TCGAACATGA TCTCGCTGTT CTTCATCGGC CCGATGCTGG AGGCGATGCT CGGCCGGCTG
CGGTTCGTGC TGGTCTACCT GATCGGCGGC CTGGCCGGGG CGGTCACGTC CTACTGGTTC
ATGACCCCGC TGAGCCCGGC GAGCCTGGGC GCCTCGGGCG CCATCTCGGC GGTCTTCGGC
TGCCTGGTGG TGATCGGGCT GCGGCGCAAG ATCCTGGACC CCGGGATGAT CGCCGTGGTG
CTGGTGATCA ACATCGTGAT CCCGCTGCAG AACACCAACA TCGACTGGCG CGACCATGTC
GGCGGCGTGG TGGCCGGGGC GCTGATCGGC GCGGTCTACG CCTTCGCCCC GGAGCTCATC
GGCGCGCTCG GCAAGGCCAG GGCGCCACGC GAGCAGCAGG TACGGCTGCT CAACTGGCTC
GGCTTCGGCA CGATGGCGCT GGTCCTGGCC CTGGCGATCG GCGGCACGGC CGTGCACACC
GCCCACCTGA ACGACCCGGC GAACCGGACG CGCACCGTCG ACGGCGCCGT GTACTCACCC
GGCCCGACCA GGGTCGTCAC CGACGTTCCG ACAAGTTATC CACAGGCCTG A
 
Protein sequence
MPPGSPGSPS VPATELPACY RHPGREAQIR CTRCDRRICP ECMVPASVGF QCPECVRGGN 
QEVRKARSPF GAVLRPRVVP VVTYSLIALN FVMFGLQHIV GTSQVGAAGG GVFQVNTLDM
RLELIAKGTW VDGQPIGVAN GEWYRLVTSM FLHANLIHIA SNMISLFFIG PMLEAMLGRL
RFVLVYLIGG LAGAVTSYWF MTPLSPASLG ASGAISAVFG CLVVIGLRRK ILDPGMIAVV
LVINIVIPLQ NTNIDWRDHV GGVVAGALIG AVYAFAPELI GALGKARAPR EQQVRLLNWL
GFGTMALVLA LAIGGTAVHT AHLNDPANRT RTVDGAVYSP GPTRVVTDVP TSYPQA