Gene Sare_2257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2257 
Symbol 
ID5706743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2595463 
End bp2596797 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content71% 
IMG OID641271736 
Producthypothetical protein 
Protein accessionYP_001537107 
Protein GI159037854 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03440] conserved hypothetical protein TIGR03440 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.689177 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0943468 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGAGA CGACCAACCG GCCCGACGGT GTACGGCTGC GGGAGCACAT AGCGACGGAG 
CTGGCCCGCG CCCGCGCCCG TACCGCCGTG TTGACCGACG CGGTTGACGA CGCCGACCTG
GTACGACAGC ACTCGCCCCT GATGTCGCCC CTGGTGTGGG ACCTCGCCCA CGTCGGCAAC
CAGGAGGAGC TCTGGCTGGT GCGGGACGTC GGCGGCCGGG AGCCGGTTCG CCAGGACATC
GACGACCTTT ACGATGCGTT CAAGCAGCCC CGCCGGGATC GCCCGGCATT GCCGCTGCTG
CCGCCGCCGG AGGCGCGAGC ATACGTGTCG ACGGTCCGGG ACAAGGTGCT CGACCTGCTC
GACCGGGTGG CCTTCACCGA CCGGCGGCTG GTTGCGGACG GCTTCGCCTT CGGCATGATC
GTGCAACACG AACAACAGCA CGACGAGACG ATGCTCGCGA CCCACCAGCT GCGATCCGGC
CCGGCCGTGC TCGACGCGCC ACCCCCGCCG GAGCCCCGGG TTCGGGTCGC CGGCGAGGTA
CTGGTTCCGG CCGGCGAGTT CACCATGGGC GCCGACACCG ATCCGTGGGC GTTGGACAAC
GAGCGTCCCG CCCACCAGGT GTACCTGCCG GCGTACGCCA TTGACGCGGC TCCGGTCACC
AACGGTGCGT ACGCGGCGTT CATCGCCGCG GGCGGCTACC ACGACCCGCG GTGGTGGAGC
GCCGCGGGCT GGGCGTATCG GCAGCAGGCG GGCCTGACCG GGCCGTTGCA CTGGCGCCCG
GACGGCGACG GCTGGGCCTA CCACCGCTTC GGCCGGTGGG CGCCGGTACG CGAGGACGAG
CCGGTGGTGC ACGTCAGTTG GTATGAGGCG CAGGCGTACG CCGCCTGGTC AGGTAAGCGG
TTGCCAACTG AGGCGGAGTG GGAGAAGGCA GCCCGCTGGG AACCGGCGAC AGGTCGGTCC
CGCCGCTACC CGTGGGGCGA CGAGGATCCG ACGGTCGACC ATGCCAATCT GGGTCAGCGG
CACCTGTGGC CGGCACCGGT CGGGGCGTAC CCGGCCGGTG CGTCACCGCT CGGCGTCCAC
CACCTGATGG GCGACGTGTG GGAGTGGACC TCGACCACCT TCCGCGGCCA CCCCGGCTTC
GTGGCCTTCC CCTACCGGGA GTATTCCGAG GTCTTCTTCG GCGACGACTA CCGGGTGCTG
CGGGGCGGGT CGTTCGGCAC CGATCGGGCC GCCTGTCGGG GCACCTTCCG CAACTGGGAC
TATCCGATCC GGCGGCAGAT CTTCAGCGGT TTCCGCTGTG CCCGGGACGC CGCACCTGGG
GAGGCACCCG CGTGA
 
Protein sequence
MTETTNRPDG VRLREHIATE LARARARTAV LTDAVDDADL VRQHSPLMSP LVWDLAHVGN 
QEELWLVRDV GGREPVRQDI DDLYDAFKQP RRDRPALPLL PPPEARAYVS TVRDKVLDLL
DRVAFTDRRL VADGFAFGMI VQHEQQHDET MLATHQLRSG PAVLDAPPPP EPRVRVAGEV
LVPAGEFTMG ADTDPWALDN ERPAHQVYLP AYAIDAAPVT NGAYAAFIAA GGYHDPRWWS
AAGWAYRQQA GLTGPLHWRP DGDGWAYHRF GRWAPVREDE PVVHVSWYEA QAYAAWSGKR
LPTEAEWEKA ARWEPATGRS RRYPWGDEDP TVDHANLGQR HLWPAPVGAY PAGASPLGVH
HLMGDVWEWT STTFRGHPGF VAFPYREYSE VFFGDDYRVL RGGSFGTDRA ACRGTFRNWD
YPIRRQIFSG FRCARDAAPG EAPA