Gene Sare_1368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1368 
Symbol 
ID5707287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1582826 
End bp1584184 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content71% 
IMG OID641270879 
Producthypothetical protein 
Protein accessionYP_001536260 
Protein GI159037007 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000999119 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTTGATCG TCGTTGGGCT CCTCTTCATC GTTGTTCTCA CCGCGGCCAC CGGATACTTC 
GTGGCCCAGG AGTTCGGCTA CGTCGCCGTG GACCGGGGCA AGCTCAAGCA GCGTGCCGCC
GATGGCGACA AGGCCTCGTC CCGGGCCCTG GAGGTGACCG GGCGGCTGTC TTTCATGCTC
TCCGGCGCCC AGCTCGGCAT CACGGTCACC GCGCTGCTGG TCGGCTACGT CGCTGAGCCC
TATCTGGGCG CCGGCCTGGC CGACCTGCTC GGTGTGGCCG GGATGTCCGA CGCGGTCGGC
CGGCCGTTGT CCGTCGCGCT GGCCCTGGTC ATCGCCACCA TCGTGCAGAT GGTGCTCGGT
GAGCTGGCAC CCAAGAACCT CGCCATCGCC CGCGCCGAGC CGCTCGCCCG GGCACTCGCC
GCCTCCACCC TGGTCTACCT CAAGGTCGCC GGCCCAGTGA TCAAACTCTT CGACCGGGCC
GCAGTTCGGC TGCTGCGCCG GGCGGGCGTC GAACCCATCG AGGAACTGCC CAGTGGGGCG
ACCCTGGAGG ACCTGGAGCA GATCATCGCC GAGTCCCGCG AGGGCGGGCA CCTGACCGCG
GAGATGTCCA CCCTGCTCGA CCGTGGGCTG GATTTCCGCC AGCTCACCGC GGGGGAGGCC
ATGGTGCCCC GGGTGGACGT GCACACCGTC CGCGCCCACG AGCCGGTCAG CCGTGTCGTC
GAGCTGTTGG AAACCGGCCG TTCCCGGTTT CCCGTCCAGG GCGCCGAGGG CGTGGACGAC
GTGATCGGTG TGACCGGCAT CGCCGACGTG CTCGGCGTGC CACTGGAACG CCGGGCCACC
ACCCCGGTCG GTACGGTGGC GGTCCCCCCG CTGCTGGTGC CCGAGACCCT GCCGCTGCCG
ACGGTGCTGG ACCGGTTGCG CTCCAGCCAC CGGCAGCTCG CCTGCGTGGT GGACGAGTAC
GGCGGCTTCG CCGGGGTGAT CACCCTGGAG GACATCGCCG AGGAGTTGGT GGGGCCGATC
CGGGACGAGG ACGACCCACC GGAGCGAATC CCCACCCGGC AGAGCGACGG CTCCTGGATC
GTGCCGGCCC GCTGGCGGAT CGACGAGGTC GCCGACAGCA CCGGCATCTC GTTGCCCGTG
GCCCCGGAGT ACGACACGCT CTCCGGCCTG GTCATGCGGG AGCTGGGCCG GGTACCCGAG
GTCGGCGACC GGCTGGAGGT CACCGTGCCG GAGGACGCGG ACGAGGCGGT GGCCGGTACC
CGGGTGCTGA TCGAGGTGCT CGGGGTCGAC CGGCACGTCG CCGACTCGGT CCGGCTGTTC
CTGCGTGAGT CGGATGGTGA CCCGGAGGTG CCCTCATGA
 
Protein sequence
MLIVVGLLFI VVLTAATGYF VAQEFGYVAV DRGKLKQRAA DGDKASSRAL EVTGRLSFML 
SGAQLGITVT ALLVGYVAEP YLGAGLADLL GVAGMSDAVG RPLSVALALV IATIVQMVLG
ELAPKNLAIA RAEPLARALA ASTLVYLKVA GPVIKLFDRA AVRLLRRAGV EPIEELPSGA
TLEDLEQIIA ESREGGHLTA EMSTLLDRGL DFRQLTAGEA MVPRVDVHTV RAHEPVSRVV
ELLETGRSRF PVQGAEGVDD VIGVTGIADV LGVPLERRAT TPVGTVAVPP LLVPETLPLP
TVLDRLRSSH RQLACVVDEY GGFAGVITLE DIAEELVGPI RDEDDPPERI PTRQSDGSWI
VPARWRIDEV ADSTGISLPV APEYDTLSGL VMRELGRVPE VGDRLEVTVP EDADEAVAGT
RVLIEVLGVD RHVADSVRLF LRESDGDPEV PS