Gene Sare_2997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2997 
Symbol 
ID5707645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3404893 
End bp3406065 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content66% 
IMG OID641272444 
Producthypothetical protein 
Protein accessionYP_001537812 
Protein GI159038559 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.929467 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000428453 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGTTCAG CGAGAAACCT GCGGCGACTG TCCGCCTCCG GCCTCTCGGC CGTGGCCGTC 
GTCGCGCTAA CCACCGCTCC CGCCGCCGCG CGGCCTGGCG TCGCGCCGAG CGACTCGACA
TGCAGCCTCG TCAGCTCGAC CTGGCAGGCC GAGGCCGCCG TCAACACGGA GCTCACCGAG
CGATTCGACG CCTACGGCAA CTCAGGCGTC GGCTGGACTG GTGGTGACAG CACGTACTCG
GTGCGCCAGC GCGGCGGACG CACGCTGTGG CTCTTCTCGG ACACCTTCCT CGGACCGGTC
AACCCCGACC TGAGCCGACC CACGTCAGTC CCATTCCTCA ACAACTCCTT CGTCGTCGAG
AAGGGTGACA AGCTGACGAC AGTCACCGGC GGGACGTCGG AGCAACCCGA CTCGCTGGTA
CCCCCGGACG AGGCAGACAC GTGGAACTGG CTGGGTGCCG GTGTCGAAAC GCGCGACTCG
CTCGACGTGA TGTTCCTCGA GTTCGGTGTG TTCGGGCCCG GGACGTGGGA CTGGGAATGG
CGCGAGAACA AGCTGGTGCG CTTCGATCCG AAGACCTACG CCGTGCGCGA GGTCGTACCC
ATGCCGTCGT CGGCTGACAT CCAATGGGCG TCGTGGATCG AGCGCCGCGG CCGGGACACG
TATGTGTACG GCGTCGAAGA CCAGGGTGCG ACGAAGTACA TGCACCTCGC CCGAGTCTCG
GGAGACGACC TCACCCAGCC CTGGTCCTAC TGGACCGGCG ACGGCTGGTC GCCCGACGAG
CAGACCTCCG CCCGCATCAT GCCGGGCGTC GCCAACGAGT ACAGCGTCAC GAAGTTCAAG
GACGGCTACC TGCTCGTCAC GCACGACACC AGCGAGCTGT TCAGCCGCAA CGTCGTGGCC
TACATGAGCT GCACGCCGTA CGGTCCCTTC ACCCGGGCCA CCACGCTCTA CCAGACGCCC
GAAACCGGGC AGTCCGGCTC GTACGGCAAC CCGAACATCA TCACCTACAA CGCCCACGAG
CATCCTGACC TGCGCCGCGG CAACCAGTTG CTGCTCACCT ACAACGTCAA CAGCCTCGAC
CCCAACGCTG ACCTCTACGA CGACGTGACG ATCTACCGTC CCCGCTTCGT CAACGTGACG
CTGACGCGCG TTCAGGCAGA GGGGACGCGA TGA
 
Protein sequence
MRSARNLRRL SASGLSAVAV VALTTAPAAA RPGVAPSDST CSLVSSTWQA EAAVNTELTE 
RFDAYGNSGV GWTGGDSTYS VRQRGGRTLW LFSDTFLGPV NPDLSRPTSV PFLNNSFVVE
KGDKLTTVTG GTSEQPDSLV PPDEADTWNW LGAGVETRDS LDVMFLEFGV FGPGTWDWEW
RENKLVRFDP KTYAVREVVP MPSSADIQWA SWIERRGRDT YVYGVEDQGA TKYMHLARVS
GDDLTQPWSY WTGDGWSPDE QTSARIMPGV ANEYSVTKFK DGYLLVTHDT SELFSRNVVA
YMSCTPYGPF TRATTLYQTP ETGQSGSYGN PNIITYNAHE HPDLRRGNQL LLTYNVNSLD
PNADLYDDVT IYRPRFVNVT LTRVQAEGTR