Gene Sare_1397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1397 
Symbol 
ID5704083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1612919 
End bp1614637 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content72% 
IMG OID641270907 
Producthypothetical protein 
Protein accessionYP_001536288 
Protein GI159037035 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0022466 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCGAAG TGTCGGCGCA CCGGGTTGGT CAGGTCCTGA TGGTCGGTGC CGAGGCGGCG 
GTCGCTCGGG CCGCCGCACA CCTCACCCAA CCTCCCGCCG AACCCGGCCG GACCACCGTG
CAGGTTGTCG GCGACGACCT ACGTGACTGG GAACAGCTCG CCCCGACCCT CGCCGCGCAC
ACGTCGGAGG TGGGCACCTC GGTGCGCCTG GTGTCGCCGC CGGCGATGAA CCCGCCATCG
ATAGCGTCCG CGCGTCGGCT GGGCGACCAG CTGGGCATCG AGGTGGTCGC CCCCGACGGG
CCGCTCCTAC CGGCCCGCGA CGGCAGCATG TTCGTGCTCG GCCCCGACGC GTCCTGGTGG
ATCATCGAGG CAGGCGCGGC GGCGCGGCAG GTCGGGCCGC ACCACCCCAC TCCGTGGTGG
GCCGAGCATC GGCCCACCAC GGCCACCGAC TGGGTGACCA TTCCGGCCGG TGGGTGGCTG
CCCGGCGGCG AGCGGCCGGA CGCCGCGGTG CCCGACGACC TCGTTCTCGC GGTACCGCGC
CACGACTCGT TGTTCACGAT GGTGGTGGGT GCGCCCGATC AGCCCCCGGT GGCTCGGGAG
ACGCTGCTGG CCAGCGTGGC TGACCTGCCG GCGCCCGTGC GGGAGCGGCT CCTCGTCGTG
GCCTACGGCC CGGAGCAGGA CGATGCCGCC CCCGCGCTGG CCGCCACGTT CGGCGTCCGC
GTCTACGGTG CCGACGGACT GCCCGGGTAC GGCCCGGCCG GCGAGATCGT TGTCCGAGCC
GTGGCCGCGG ATGGCCGGCG TGGGCCGCGC CAATGTGCTC GGACCTTCGC TCAGGCCCCG
GAAGAGGCCG AGGGGCGACC AGCATTGCGC GACGAACCAA GCGCTTCGCC GGACGAATGG
CCGGCGGGGA CCTCCGCCAC GGACATTGCG GCGGAGGCGC CGTTGCCGCC GGTCCGGGTC
CGGCCGGACC AGCGCAGCGC CGCACCGGAA CGCCATCGGC TGCGTGGGGC GCTCGGGGCG
TACTACGACC TCCACGCGCG GGTGGTGGCG CGACTGTTGG CTCAGCACCC GGGGCTGCGC
GTGCTCCCCG CCGGGGATGA GCCCCATGCG TTGATGACGG ACCTGGTCGC GGTGCGGGCG
TTCCTCATGG GTGACCGCTC GTCCGTGACG GCCGCACTGC GTTCGATGTC GGACGTGGGT
GACCCGGCCT TCCTCATTTG TCTGGCATCC GGCCTCAGGC GGCTACCCAG CTATCACGGG
GTGGTGTATT CCTCTGTGCC GACAGAGCAC GCATCGCGCG TCTATCTGGA CGGCCGATCC
ATCTGGGAGC CGACATTTCT GGAGGCGTCC ACAACTCGGG TTGCCGCGGG TGCCGAGATG
ACGGATCTCA TCGTGTGGTC CACCAACGGT CGGCACGTCG GCGGAATCGT TGGTGGGGGA
GACCACCACC GCGTGGTGTT CCCCGCTCGG TCCCGCTTTG TCGTGCTCGG CCACCGGCCG
GCGGGGAGAG ACTGCTCCGC TGCGGTGTTC CTCCGTGACG TCCCTGCTGA GCCCGGGCAG
ACCGAGGGGA CGACAAACCG CCGTATCCAC AAGCGACTCG AGGCGTTGAC GGCGGCTGGC
GCGCGCGTGG GACGTCAGGC CGCGTCCGAT CCTGGCTGGG CGGTCAGTGG TGAGCTGCCG
GGTTGTGACG AGACAGGGCG ACCGTATCGG TCGGAGTAA
 
Protein sequence
MIEVSAHRVG QVLMVGAEAA VARAAAHLTQ PPAEPGRTTV QVVGDDLRDW EQLAPTLAAH 
TSEVGTSVRL VSPPAMNPPS IASARRLGDQ LGIEVVAPDG PLLPARDGSM FVLGPDASWW
IIEAGAAARQ VGPHHPTPWW AEHRPTTATD WVTIPAGGWL PGGERPDAAV PDDLVLAVPR
HDSLFTMVVG APDQPPVARE TLLASVADLP APVRERLLVV AYGPEQDDAA PALAATFGVR
VYGADGLPGY GPAGEIVVRA VAADGRRGPR QCARTFAQAP EEAEGRPALR DEPSASPDEW
PAGTSATDIA AEAPLPPVRV RPDQRSAAPE RHRLRGALGA YYDLHARVVA RLLAQHPGLR
VLPAGDEPHA LMTDLVAVRA FLMGDRSSVT AALRSMSDVG DPAFLICLAS GLRRLPSYHG
VVYSSVPTEH ASRVYLDGRS IWEPTFLEAS TTRVAAGAEM TDLIVWSTNG RHVGGIVGGG
DHHRVVFPAR SRFVVLGHRP AGRDCSAAVF LRDVPAEPGQ TEGTTNRRIH KRLEALTAAG
ARVGRQAASD PGWAVSGELP GCDETGRPYR SE