Gene Caci_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1037 
Symbol 
ID8332372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1178717 
End bp1179832 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content68% 
IMG OID644954185 
Producthypothetical protein 
Protein accessionYP_003111804 
Protein GI256390240 
COG category[S] Function unknown 
COG ID[COG4427] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.314959 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0209911 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCG ATGACTTCCG TAGGTTCGCG ACGACGGCGG CCGGGACCTC GCCGCTGTAC 
GCCGCGCTGG CCGAGCAGGT TGCCGATGAT GGGCGGCTGC GGGAGGTGTC TGAGGCCGCT
GGCGATCCGT CGGTGGCGTT GTTCTTCGCT GCTGTGCAGC GGGTGCTGGC TGACCGTGGG
GACCATCCGT TGGCTGCCTA CTACCCGTCG TTCGGCGGCG ACCGCGCGCC GGATGCGGAG
CTGGCGGAGG CCTTTGAGGG CTTCGTGGTG GGACATCGTG ATCGGCTTGA GGCGTTGCTG
GTGACGGGAC ACGTCCAGAG CAACGAACCG TTGCGGGCCG CGCAGTTGCG GCCGGCGTTC
GGCTGGGCCC AGGCCGGGCT CGGGCGTGCG TTGGGTCTGA TCGAGGTCGG GACCAGTGCG
GGGCTCTTGT TGTATCCGGA GCGCTATGGC TACGTATACG AGTTCGGCGA CGGCTCGGTG
CTGGAACGGC TGCCCGCAGC GGATCCTGAC CAGCGCGACG ATGTTCCGGG ACCGGTGCTG
CGGTGTCTGG TGCGCGGCGC GGCGACTGCG AAGACGCTTG CCCCGTTCGT CAGCAAGGAG
CTGCGCGTTT CTTCGCGTGT CGGTATCGAC CTGAATCCGT TGAAGCCGGC CGATGCCGAG
ACCAGAGCGT GGCTGCGCGC GCAGGTCTGG CCGGAGGAAG CCGATCGCCT GGCGCGTTTG
GACGCGGCCC TGGCCATGGC GGCCCGGTAT CCGTTGCGGC TGCGCCAGGG CGATGTGCTC
GACATCCTTC CGGCGGCGAT CGGGATGGTG GCGGCTCCGT CCGTGCCGTG CGTCTTTCTC
TCCAACACGC TGGCGCACCT CACTGCCGAG GCTCGCACCT CGTTCGTCGA GATCATCAGG
GCCCTGGGAT CGAGTCGAGA TCTGGTGCTG ATCCTGAAGG AACCTGATGC GGTGGGCTTG
GGGCTGTTCG TTGAGCGGCC GGGCGGGGAT CCGTCTGCGG CGCGGGCCGA CTCGTTGGGT
GCCGTCCTCT ACCAGTCGGG TCGTGAGCGG TCCTTCTTGC TCGGCACGGC CGGATCGCGA
GGCGACTGGC TGGACTGGTC GCCTGCCATG CTCTGA
 
Protein sequence
MSADDFRRFA TTAAGTSPLY AALAEQVADD GRLREVSEAA GDPSVALFFA AVQRVLADRG 
DHPLAAYYPS FGGDRAPDAE LAEAFEGFVV GHRDRLEALL VTGHVQSNEP LRAAQLRPAF
GWAQAGLGRA LGLIEVGTSA GLLLYPERYG YVYEFGDGSV LERLPAADPD QRDDVPGPVL
RCLVRGAATA KTLAPFVSKE LRVSSRVGID LNPLKPADAE TRAWLRAQVW PEEADRLARL
DAALAMAARY PLRLRQGDVL DILPAAIGMV AAPSVPCVFL SNTLAHLTAE ARTSFVEIIR
ALGSSRDLVL ILKEPDAVGL GLFVERPGGD PSAARADSLG AVLYQSGRER SFLLGTAGSR
GDWLDWSPAM L