Gene Caci_5084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5084 
Symbol 
ID8336438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5841264 
End bp5842784 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content68% 
IMG OID644958183 
Producthypothetical protein 
Protein accessionYP_003115785 
Protein GI256394221 
COG category[S] Function unknown 
COG ID[COG3463] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.108071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.201189 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAAG ATCTCGCGGA GCGCCCGGCC GCTCGCCGCT CTCCGCATGG CTATGTGCTC 
GCCGCGATGG TCGTTCTGGT CGCCGTGGCG TATGCGGTCT ATGAGCTGAC CGTCTACCGC
ACGTATCGGT CGTCCACCTA CGACTTGGTG ATATTCGATC AGGCGGTGCG CTCCTACAGC
CACTTCCATC TGCCGGTGGC GATCGTCAAG GGTGTGCACA ACAACTTCGG CGCGCAGTTC
ACCGTGCTCG GCGACCACTT CTCGCCGATC ATCGCCGTGC TCGCGCCGCT GTATTGGATT
TCCGGCAATC CGAAGACGCT GCTGGTCGCG CAGTCCGTGT TGCTGGCAGG CGCCATTCCG
TGGCTATGGG TCTACGCGCG CCGGGCTTCG GGGACGTTCG CAGCGTACTG CGTCGTCGTG
ATCTACGCGG TGTCGTGGCC GGTGGCTGCG GCGGTCGCCT TCGACTTCCA CGAGACCGCC
TTCGTGCCGC TGCTCAGCGC GGTGCTGCTG GAGCGGTATC AGGCCGGGCG GCGCGTGCAC
GCGGTGCTCG CCGCGTGCCT CCTGCTGCTG GTCAAGGAGG ACATCGGGCT GCTCGTCGCC
GGGTTCGGCC TCTTCTTGGT GACCGGTTGC CGGCTCCCGG CGCAGGAGAA GTACGCCCGC
CAGCGTCTGC TCGGCGCCGC GTTCGTGGTC GGCGGGGTCG GCTGGACGCT CGTGGCGACG
CACGTCTTCA TCCCCGCGTT CGGCGGACGC GGAAACTACT ACTGGGCCTA CACCGCGCTG
GGCCCGGACC TGCCGAGCGC CACGACGCAC GCGATCGCGC ACCCGGTCTC GACGCTCCAG
CTGTTCGGGA CGCCGTCCAT CAAGATCACA ACGATGACCT GGCTCGTCGT GCCGCTGCTG
CTCCTCCCCC TGGCCTCGCC GCTGACCCTG ATGGTCATTC CGGCACTGGC CGCCCGCATG
GGGTCGAACC TGTTCCCCAA CTGGTGGGGC GAGGAGTATC AGTACAATGC GGAGCTGATC
GTTCCGCTGG TGGCGGCAGG CCTGGACGGC GCACTGCGTA TCAACATGCT CCTCACGCGG
CTGCGCCCCA TCTGGACCTG GACCAAGCGC ATCGGTCCAG CATGGGCAGC CGGAGCCCTC
ATGGTGTCGG TCGCAGTCAT CCCGAAGTTC GCCTTCGACG CCTACGGCCA GTCATCCTTC
TACCGGCTGA CCCCTGCGGA CCGCGCCGCC GCGACAGCCG CCGGCCACGT CCCCGACGGA
GTGGTCGTCG AGGCGGCGAG CCTGATCGGC CCGCACCTGA GCGCGCGCGA CACGGTGCTG
CTGCTGGACA AGACCCCACG CTGGGCACCC TGGGTCGTCG CGCAAATCTC CGATCCGACG
TTCCCGATCT CGGACGCCGC CGCCCAGAAG GCGCGCGTCA CATATCTGGA GACGAACGGC
TATCGCCCCG TGTGGCACGA CGACTTCTAC GTCGTGCTGA CCAAGCCGGG ATCCGTGCCG
GACTACACCA GGCACGGGTA G
 
Protein sequence
MPEDLAERPA ARRSPHGYVL AAMVVLVAVA YAVYELTVYR TYRSSTYDLV IFDQAVRSYS 
HFHLPVAIVK GVHNNFGAQF TVLGDHFSPI IAVLAPLYWI SGNPKTLLVA QSVLLAGAIP
WLWVYARRAS GTFAAYCVVV IYAVSWPVAA AVAFDFHETA FVPLLSAVLL ERYQAGRRVH
AVLAACLLLL VKEDIGLLVA GFGLFLVTGC RLPAQEKYAR QRLLGAAFVV GGVGWTLVAT
HVFIPAFGGR GNYYWAYTAL GPDLPSATTH AIAHPVSTLQ LFGTPSIKIT TMTWLVVPLL
LLPLASPLTL MVIPALAARM GSNLFPNWWG EEYQYNAELI VPLVAAGLDG ALRINMLLTR
LRPIWTWTKR IGPAWAAGAL MVSVAVIPKF AFDAYGQSSF YRLTPADRAA ATAAGHVPDG
VVVEAASLIG PHLSARDTVL LLDKTPRWAP WVVAQISDPT FPISDAAAQK ARVTYLETNG
YRPVWHDDFY VVLTKPGSVP DYTRHG