Gene Sare_3501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3501 
Symbol 
ID5703310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4040415 
End bp4041833 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content70% 
IMG OID641272928 
ProductAlpha,alpha-trehalose-phosphate synthase (UDP-forming) 
Protein accessionYP_001538294 
Protein GI159039041 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0380] Trehalose-6-phosphate synthase 
TIGRFAM ID[TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.519906 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00222704 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGACAGA GTTCCCTGGT GGTGGTCGCC AACCGCCTCC CCATCGATGA CAGTACGGCG 
CCGGACGGTG CCTGCGAATG GCGCCGCAGG CCCGGCGGCC TGGTGACCGC CCTACACTCG
CTGCTACGGC AGGCACCCGC CACCTGGGTG GGCTGGGCGG GTGGCACCGG GCCGGCCCCG
ACACTGCCCG ACGTCGACGG CGTCCGCATG CACACGGTGC CGCTCACCGT CGACGACCTC
CGCGACCACT ACGAGGGCTT CGCCAACGCC ACCCTCTGGC CGCTCTACCA CGACGCCGTG
GAGCAGCCGG AGCACCACCG CCGGTGGTGG GAGGCGTACC AACGGGTCAA CCAACGGTTC
GCGGCGGCGA CCGCCGACGT GGCCGAGACC GGCGCGGTGG TCTGGGTGCA GGACTACCAC
CTGCAGCTCG TACCCGGCCT GCTCCGGGCA CTGCGCCCGG ACCTGCGGAT CGGCTTCTTC
CTCCACGTGG CGTTCCCACC ACCCGAGCTG TTCATGCAGC TTCCCCGGCG GGCCGAGTTG
CTCCGCGGGA TACTCGGCGC GGACCTCGTC GGCTTCCAGC GGGCCCAGGC GGCGCACAAC
TTCGCCCAAC TCGCCGTCCG GGTGCTCGGG CTGCCGGCCA CCGACCGCCA GATCGTCGTG
GACGACCGAG TGGTCCGCAT CGGCTCGTTC CCCGTCTCCA TCGACAGCGC CGAAATGGCG
GCCCTGGCCA ACCGAGCCGA TGTCGCCGAC CGAGCCAACC GACTCCGCCG TGACCTGGGC
AGCCCGGAAC AGGTGATCCT CAGCGTCGAC CGGATGGACT ACACCAAGGG CATCGAGCAG
CGGCTGAAGG CGTACAGCGA GCTGATCTCC GACGGCCACG TCAAGGTACG AGACACCGTC
CTGGTCCAGG TGGCGGTGCC CAGCCGCGAG CGGGTCGGGC AATACCAGAT CCTCCGCGAA
CGGGTCGAAC GTGAGGTTGG CCGCATCAAC GGCGAATTCG GTCGCGTCGG CGAACCGGCC
ATCCACTACC TGACCCGACC CTTCGACCGC GCCGAACTGG CCGCGCTCTA CCGGGTCGCC
GACGTGATGG CGGTGACCCC ACTGCGGGAC GGCATGAACC TGGTGGCCAA GGAATACGTA
GCCGCTCGGG TCGACGACAC CGGTGCGCTG CTGCTCAGCG AGTTCGCCGG CGCCGGGGCG
GAGCTGTCCC AGGCGTATCT GGTGAACCCG CATGATCTGG AAGGTCTCAA GCAGGGTCTT
CTCGCGGCGC TGCGGGCCCG GCCGGACCAC GTCCGCAAAC GGATGCGGGC GATGCGGGCG
CACCTGCGCA AGCACGACAT CCACGCATGG GCGCGCTCCT ACCTTGCCGC CCTCGACGAC
AACGGCTCGC TGCTCAGCCG ACTCGGTACG ACCCGCTGA
 
Protein sequence
MRQSSLVVVA NRLPIDDSTA PDGACEWRRR PGGLVTALHS LLRQAPATWV GWAGGTGPAP 
TLPDVDGVRM HTVPLTVDDL RDHYEGFANA TLWPLYHDAV EQPEHHRRWW EAYQRVNQRF
AAATADVAET GAVVWVQDYH LQLVPGLLRA LRPDLRIGFF LHVAFPPPEL FMQLPRRAEL
LRGILGADLV GFQRAQAAHN FAQLAVRVLG LPATDRQIVV DDRVVRIGSF PVSIDSAEMA
ALANRADVAD RANRLRRDLG SPEQVILSVD RMDYTKGIEQ RLKAYSELIS DGHVKVRDTV
LVQVAVPSRE RVGQYQILRE RVEREVGRIN GEFGRVGEPA IHYLTRPFDR AELAALYRVA
DVMAVTPLRD GMNLVAKEYV AARVDDTGAL LLSEFAGAGA ELSQAYLVNP HDLEGLKQGL
LAALRARPDH VRKRMRAMRA HLRKHDIHAW ARSYLAALDD NGSLLSRLGT TR