Gene Sare_2112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2112 
Symbol 
ID5704966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2432930 
End bp2434333 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content70% 
IMG OID641271597 
Producthypothetical protein 
Protein accessionYP_001536968 
Protein GI159037715 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0819565 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0120875 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCG TGATCGCCGT CCAGCCCTCG CATGCCCACC TCTGGCCCAT CACGCCGGTA 
GCGTGGGCGT TGCAGAGCGC GGGCCACGAG GTACGCGTCG CCACCCACGC CCGGTTCGCC
GATTCGGTCC GGGCCGCCGG ACTGACCCCG GTCGGTCTCG GTGACCCGGC GGCCGACGAG
GCCCGTACCC GCTCGGACGC GCGTCCCCCG GCCCGGCCCG AGGAGGTGCT GCGGTACGCG
GACGTACTCG GCCTCGACGA GCAGGGACGG GAGCACTGGA TCGCCTTCTA TCAGTGGCTG
CTGAACCCCA TCTCGGACTA CATCCGGGCC GACCTGCCGT ACGCGGTCGA CCTCGTGGAC
TTCGCCCGGG CCTGGCGGCC CGACCTGGTG ATCTGGGACG CGACGATGGC GGCGGCGTCG
ATGGCAGCCC GGGTCAGCGG CGCGGCGCAC GCCCGATTCA CCCTCAACCT GGACTATCCG
GGCTGGTGCT TCGACCGGCT GCGGGAGCGT CGGGCCGAAC TGCGTGCGGC GGGTCTGTCC
GAGAACCCGG TGGCCGACCT GCTCCAGCCA TTGGCCGACA AGTACGGCAT CGAGGTCGAC
GACGAGATCC TGTACGGGCA GTGGACCATC GACCCGATGC CGACCGGGAT GAGCCTGCCG
ACCAGCGCCA CGGTCCTACC GGTACGGTAC GTGCCGTTCA CCGGGGCGGA CCTGATACCG
GAGTGGCTGC GCGGCGCACC ACAGCGGCCC CGGGTGGCGT TGACGCTGGG CGAGTCGACG
CGTCGGTTCA TCAAGGGCGA CTGGGGCCGC ACCCCGAAGA TCCTGGAAGC GGTGGCGGAC
CTCGATATCG AGGTGGTCGC CACGCTCAAC GCCCAGCAAC TGGAGGGTGT CGAGCAGGTC
CCCGACAATG TGCGGGCGCT CGAGTGGGTG TCGCTGACCC AGCTCATGCC CACCTGCTCG
GCGGTCATCC ACCACGGCGG CGGCGGGACA TTCGCCGCAC CGGTGGCCTT CAACCTGCCG
CAGATCGTCT GCGACACCGA CGAGTCGTTG ATGATGCAGC CGGTCGAGGT CGACCCGCGG
ACGATGGCCG ACGGCACCTA CCGGGTCGGA TTCGAGTTCG GCGTCAGCGA GGAGGTGGTC
CAGACGGTGA CCACCTGGCA ACTGCCGGGG AAGAAGTTGG AGGCGACGCC GACGGCGGAC
TACGTGGTAC GCCGGGGTGC CGGCGTCCGC CTCGACCACT ACGAGAAGTC GGTCGAGGAG
GTCCGGACAA TGATCCAAGA CGTGGTGCGT GAGCCGTCGT ACCGCGACGG TGCCCGGGCG
ATTTTCGACA CCTGGCTGGC CATGCCGAGC CCCGCTGACA TCGTCCCGCT ACTGGAACGA
CTCGCGGGGG AGCACCGTCG TTAG
 
Protein sequence
MRIVIAVQPS HAHLWPITPV AWALQSAGHE VRVATHARFA DSVRAAGLTP VGLGDPAADE 
ARTRSDARPP ARPEEVLRYA DVLGLDEQGR EHWIAFYQWL LNPISDYIRA DLPYAVDLVD
FARAWRPDLV IWDATMAAAS MAARVSGAAH ARFTLNLDYP GWCFDRLRER RAELRAAGLS
ENPVADLLQP LADKYGIEVD DEILYGQWTI DPMPTGMSLP TSATVLPVRY VPFTGADLIP
EWLRGAPQRP RVALTLGEST RRFIKGDWGR TPKILEAVAD LDIEVVATLN AQQLEGVEQV
PDNVRALEWV SLTQLMPTCS AVIHHGGGGT FAAPVAFNLP QIVCDTDESL MMQPVEVDPR
TMADGTYRVG FEFGVSEEVV QTVTTWQLPG KKLEATPTAD YVVRRGAGVR LDHYEKSVEE
VRTMIQDVVR EPSYRDGARA IFDTWLAMPS PADIVPLLER LAGEHRR