Gene Sare_2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2033 
Symbol 
ID5705687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2327112 
End bp2328248 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content71% 
IMG OID641271523 
Producthypothetical protein 
Protein accessionYP_001536894 
Protein GI159037641 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0492847 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0634839 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTCC TGTTCGTCTC CTCCCCCGGT ATCGGTCACC TGTTCCCCCT GGTCCAGCTC 
GCCTGGAGCT TCCGCACGGC TGGCCACGAC GTGGTCGTCG CGCTGGCCGA ACACACCCAG
AAGGCCGCCG CCGCCGGTCT GGAGGTCGTG GACGTGGCCC CGGACTACAG CGCGGTCAAG
GTCTTCGAGC AGGTGGCCAA GGACAACCCG CGGTTCGCCG AGACGGTCGC CACCCGCCCC
GCCATCGACC TGGAGGAGTG GGGTGTGCAG ATCGCCGCAG TCAACCGGCC GTTGGTGGAC
CGCACCATCG CCCTCGCCGA CGACTTCACG CCCGACCTGG TCGTCTACGA GCAGGGCGCT
ACCGTCGGGC TGCTCGCCGC CGCGCGTGCC GGAGTACCCG CCATCCAGCG CAACCAGAGC
GCCTGGCGCA CCCGGGGCAT GCACACCTCG ATCGCCTCCT TCCTCACCGA CCTGATGGAG
AAGCACCAGG TCACCCTGCC CAAGCCGAGC GTGATGATCG AGTCGTTCCC GCCGAGCCTG
CTGCTGGAGG CAGAGCCGGA GGGCTGGTTC ATGCGTTGGG TGCCGTACGG CGGTGGGGCG
GTCCTCGGCG ACCGGCTGCC GGCGTCCCCA CCCCGCCCGG AGGTGGCCAT CACGATGGGC
ACCATCGAAC TCCAGGCGTT CGGTATCGGC GCGGTGGCGC CCGTCATCGC CGCCGCCGCC
GAGGTGGACG CCGACTTCGT ACTGGCGCTC GGCGACCTCG ACACCACACC GTTGGGCAAG
CTGCCGCCGA ACATACGTGC GGTCGGCTGG ACCCCGCTGC ACACGCTGCT GCGGACCTGC
ACCGCCGTGG TGCACCACGG CGGTGGCGGC ACGGTGATGA CCGCGATCGA CGCGGGTCTG
CCGCAGTTAC TCGCCCCCGA CCCCCGCGAC CAGTTCCAGC ACACCGCCCG GCAGGCGGTC
AGCCGACGCG GCATCGGCGT GGTGAGCACC GCCGACAAGG TCGACGCTGA CCTGCTGCGA
CGGCTCATCG GGGACGAGTC GATGCGCGCG GCAGTGCGGG AGGTTCGCGA GGAGATGCGG
GCGCTGCCCA CGCCGGCAGA GACGGTACGG CGTCTCGTGG AGTATGTCGC CGACTGA
 
Protein sequence
MRVLFVSSPG IGHLFPLVQL AWSFRTAGHD VVVALAEHTQ KAAAAGLEVV DVAPDYSAVK 
VFEQVAKDNP RFAETVATRP AIDLEEWGVQ IAAVNRPLVD RTIALADDFT PDLVVYEQGA
TVGLLAAARA GVPAIQRNQS AWRTRGMHTS IASFLTDLME KHQVTLPKPS VMIESFPPSL
LLEAEPEGWF MRWVPYGGGA VLGDRLPASP PRPEVAITMG TIELQAFGIG AVAPVIAAAA
EVDADFVLAL GDLDTTPLGK LPPNIRAVGW TPLHTLLRTC TAVVHHGGGG TVMTAIDAGL
PQLLAPDPRD QFQHTARQAV SRRGIGVVST ADKVDADLLR RLIGDESMRA AVREVREEMR
ALPTPAETVR RLVEYVAD