Gene Sare_2591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2591 
Symbol 
ID5707176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2953618 
End bp2955798 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content74% 
IMG OID641272053 
Producthypothetical protein 
Protein accessionYP_001537423 
Protein GI159038170 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.245728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.176917 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACTG CATACGACAC GGTGGCGCAG ACCCGGCCGC GAGTGCGCCA CGACGTGCTG 
TTCACCCGAA CCGAGGACGG AGTGCTGTTC CACAACGCCA CCAGCGGCTT CCGGTTCTCC
TCCACCACCG CGTACCGGCT GGCGTCGGTA CTCGTCCCGC ACCTGAACGG GCGCAATCAG
GTCGCGGACA TCTGTGCCCG GCTACCCGCC GGGCAACGGG CCATGATCGG TGAGCTGGTG
AGCACCCTCT ACGCACGCGG CTTCGCCCGC GACGTCCCCG AGACGGAGGG AGACCCGACG
GCGATCCTCG GCCCGGCGGT GGCCGCGCAC TTCGCCACCC AGGTTGCCTA CCTCGACCAC
TACACCGACC GGGCGCCGCA GCGCTTCGCC ACCTTCCGGC ACACCTCGGT CGCGGTGCTG
GGCGCCGGCC CGGTCGCCAC CGCCTGCGCC ACCGGACTGC TTCGCAACGG CGCCGCCACG
GTGACCGTCT CGCCGGCGAT CGCGCCGAGG CTCGCGCCCG AGCTGGCCGA GCTCGACGCC
GCAGGCTGCC CCGCGACGAC CGTGCCGCTG CCCACGACCG GCAACGAGGT CGGCTGGTCC
GACCTGGCCA CAGCGCAGAT CGTCGTGGTG GCCGGCGGGG ACGACGCGCC CCGCGACACC
CTGCGGCTCC TCGCGGCCGG CGTTCCCGCC GACCGGCTGC TCCTGCCGGC CTGGGTCGCC
GGCGGACGGA TGCTCGTCGG GCCGGTGCAG GGCGAAGGCC GTACCGGATG CTGGTGCTGC
GTCATGCGGC GACTCGCCGA CAACGACGAG ACCGGTGGCG CCGGGCAGGT GTGGCAGGCG
GCGGCGCTCC CGTCCGGGGC CGCGCCAGCC GCCACCGAGC CCGACGGACC GCTTGCGGCG
ATGATCGGCA ACCTGTTGGC GTACGAGGTG TTCCGGCTGA CCACCGGCGC GCTGCCCGCC
GAGACCGACG GGAGCGTGAT CGTTCAGCAC CTCGCCTCAC TCGACGTGCT GACCGAGCAA
CTTCTTGCGC ACCCCCGGTG CACCTTCTGC CGGCCGGCAC CGCCCGAACC GGCCTGGACC
ACCGAAGGGC TGGACGAAGC GCCTGCCGAG GCCGCCTCCG CGGCCGACCC GGCCGCGGGG
GCGCAGGAAG CGCTCGCACA GTTGGAGTCC CACCAACCGC TACTCCAGCC GCATCTGGGC
GTGTTCCGTC GCTACGACGA CGAGCGTTGG GACCAGACCC CGATCAAGGT GGGCGCGGTC
GAGCTCACCG ACGGCAGCGG CCGGCGCCGA ACGGTGACCG CGTTCGACGT CCACCACGTC
GCGGCGGCCC GGCTGCGGGC ACTACGGATC GCGGCCGTCG TCAACACCAG CAGCATCGCG
GTCGGGACCC CCGCCCCCCA GGGCGCCGAG CGGGTCGACG CCGCCCGACT CGGCCTCGCC
TCAGGCTGGG GCGACGCCCC GGTGCAACGG TGGGCCACCG CCCGGTCACT GCTCAGCCGC
GAGGTGGTGG CGGTGCCGAT GCCCGTGCTG GAGCCGTTCG GCGCGGCCAA CCGCCGGCAC
GAGGCCGAGC CGACCAGCGC CGGTGGGGGA GCGGGCGGCG ACCTCACCGA GGCCGTCCGG
GCCGGCCTCG CCTCGGCGCT CGCCGGGCAC ACGCTACGGC AGGTCATCGC CGGTCGGGAC
ACCGTCCGAC GGATCCGCCT CGACACCATC GGCACCACCC CCGAGCTGGT GTTTCTCACC
AGGTCAACAG CGAACCTGGG CGTCACCATG GAACTGCTCG ACCTCGGTGG ACAGCGCGAC
ACCGGGGCGG CGGTGCTGCT GGCCCGCTCC TTCGACCCGG ACCGCGGTCA ATGGACGTTC
GCGCTGACCG CCGATCCGGA CTGGACGACG GCGGCCGCGG CGGCGCTGCG CGACGTGCTC
GGTCAGGCCC AGCTTCGCGC GCAGGACCCC GAGCTCGTAC CAGATATCGG CGACCCGGTG
CTGGTCGACT TCGACCCCGG CACGGTGCCG GTGCACGACG AGGTGGACGC GGCCGGGGTG
CGGCACCGCT GGGCCGACGT GCTGCAGCGG CTACCGGGAC TCGGGTACGA CGTGCTGGTG
GCGCCCGTCG GTGGTGCGGA CCTGGCCGCG GGCGGGCTGG TGGCGGTCAA AGTGCTGCTC
GCCGCCGGAG ACCACCGATG A
 
Protein sequence
MSTAYDTVAQ TRPRVRHDVL FTRTEDGVLF HNATSGFRFS STTAYRLASV LVPHLNGRNQ 
VADICARLPA GQRAMIGELV STLYARGFAR DVPETEGDPT AILGPAVAAH FATQVAYLDH
YTDRAPQRFA TFRHTSVAVL GAGPVATACA TGLLRNGAAT VTVSPAIAPR LAPELAELDA
AGCPATTVPL PTTGNEVGWS DLATAQIVVV AGGDDAPRDT LRLLAAGVPA DRLLLPAWVA
GGRMLVGPVQ GEGRTGCWCC VMRRLADNDE TGGAGQVWQA AALPSGAAPA ATEPDGPLAA
MIGNLLAYEV FRLTTGALPA ETDGSVIVQH LASLDVLTEQ LLAHPRCTFC RPAPPEPAWT
TEGLDEAPAE AASAADPAAG AQEALAQLES HQPLLQPHLG VFRRYDDERW DQTPIKVGAV
ELTDGSGRRR TVTAFDVHHV AAARLRALRI AAVVNTSSIA VGTPAPQGAE RVDAARLGLA
SGWGDAPVQR WATARSLLSR EVVAVPMPVL EPFGAANRRH EAEPTSAGGG AGGDLTEAVR
AGLASALAGH TLRQVIAGRD TVRRIRLDTI GTTPELVFLT RSTANLGVTM ELLDLGGQRD
TGAAVLLARS FDPDRGQWTF ALTADPDWTT AAAAALRDVL GQAQLRAQDP ELVPDIGDPV
LVDFDPGTVP VHDEVDAAGV RHRWADVLQR LPGLGYDVLV APVGGADLAA GGLVAVKVLL
AAGDHR