Gene Sare_2359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2359 
Symbol 
ID5704991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2714262 
End bp2715761 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content68% 
IMG OID641271837 
Producthypothetical protein 
Protein accessionYP_001537208 
Protein GI159037955 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0575455 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCACCG AGGTCGAGTA CGGCATCTCC GTGCCCGGTC AGGCCGGGGC CAATCCGATG 
GTCACCTCCT CCCAGGTGGT CAACGCCTAC GGGGCGCGTC CGGAACTCAA CCGGGGTGGC
CGGGCTCGGT GGGACTACGA GGAGGAGTCG CCGCTGCGCG ACGCGCGTGG TTTCACCTAC
TCCGGGGCCG CGTACGACCC TGCGGAGGCC CTCGCCGACG AGGATCTCGG CCTGGCCAAC
GTGATACTCA CCAACGGAGC GCGGCTCTAC GTTGATCACG CCCATCCGGA GTACTCCACT
CCTGAGGTGA CCACTCCCTG GGATCTGGTT CGGTGGGACA AGGCGGGGGA GTTGGTGATG
GCCGAGGCGG CCCGGCGTGC CGCCACCATC CCGGGTAGCC ACCCCATTCA CCTGTACAAG
AACAACACCG ACAACAAGGG CGCCAGTTAC GGCGCCCACG AGAACTACCT GATGCGGCGG
CAGACCGCCT TCGCCGACAT CGTCACGTAC CTGACGCCCT TCTTCGTCAC CCGGCAGATC
GTCGCCGGCG CCGGGCGAGT GGGCATCGGT CAGGACGGTG GTCAGAGCGG CTTCCAGATC
TCCCAGCGCG CCGACTTCTT CGAGGTCGAG GTCGGGCTGG AGACCACGCT CAAGCGGCCC
ATCATCAACA CCCGCGACGA GCCGCACGCC GACGCCGACA GGTACCGGCG GCTGCACGTC
ATCATCGGTG ACGCCAACCT GTCGGAGATC TCGACGTATC TCAAGCTGGG CACGACCGCC
CTGATCCTCA CCATGATCGA GGAGAAGGCG CTCGTTGCGG ACCTCGGCAT CGCGGATCCG
GTCAGCGAGC TGCGGGCGGT CAGTCACGAC CCGTCACTCG GCCACCTCAT GCGACTGCGG
GACGGGCGGC GACTCACCGC CCTGGACGTG CAGTGGGCCT ACTACGAGCG GGTACGCTCC
TTCGTGGACG ACCGGTACGG CAGCGATGTC GACGAGCAGA CCGCCGACGT GCTGGACCGC
TGGGAGAGCG TGCTGGACCG GCTGGGTCGG GACGCGTTCC TGTGTGCCGA CGAACTCGAC
TGGGTGGCGA AGCTGCGGCT GTTGGAGGGC TACCGGGAAC GGGAGAAGCT CGGCTGGGGG
GCGCACAAGC TGCAACTGGT TGACCTGCAG TACTCCGACG TTCGCCCGGA GAAGGGCCTC
TACCACCGGC TGGTGTCGCG GGGCGCGATG AAGACGCTGC TGCCGGTGGA GGCGACCCAG
GCCGCGATGA CCGAGCCGCC GGAGGACACC CGGGCCTACT TCCGAGGTCG TTGCCTCGCC
CAGTACGCCT CCGAGGTGGT CGCCGCGAGC TGGGACTCGG TCATCTTCGA CGTGGGCCGT
GAGTCGCTGG TGCGGGTGCC GATGATGGAG CCGGAGCGGG GCACCCGCAA GCACGTCGGC
GCGCTCTTCG ACCGGTGCGA GAGTGCCAAG GATCTGCTGG AGACGCTGAC CAATGGTTGA
 
Protein sequence
MGTEVEYGIS VPGQAGANPM VTSSQVVNAY GARPELNRGG RARWDYEEES PLRDARGFTY 
SGAAYDPAEA LADEDLGLAN VILTNGARLY VDHAHPEYST PEVTTPWDLV RWDKAGELVM
AEAARRAATI PGSHPIHLYK NNTDNKGASY GAHENYLMRR QTAFADIVTY LTPFFVTRQI
VAGAGRVGIG QDGGQSGFQI SQRADFFEVE VGLETTLKRP IINTRDEPHA DADRYRRLHV
IIGDANLSEI STYLKLGTTA LILTMIEEKA LVADLGIADP VSELRAVSHD PSLGHLMRLR
DGRRLTALDV QWAYYERVRS FVDDRYGSDV DEQTADVLDR WESVLDRLGR DAFLCADELD
WVAKLRLLEG YREREKLGWG AHKLQLVDLQ YSDVRPEKGL YHRLVSRGAM KTLLPVEATQ
AAMTEPPEDT RAYFRGRCLA QYASEVVAAS WDSVIFDVGR ESLVRVPMME PERGTRKHVG
ALFDRCESAK DLLETLTNG