Gene Sare_2290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2290 
Symbol 
ID5705880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2629748 
End bp2631226 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content68% 
IMG OID641271768 
Producthypothetical protein 
Protein accessionYP_001537139 
Protein GI159037886 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.134356 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0290276 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGAGG AAGTCGACGT GCGGACCTTC AACCGTGCGG ATCGGGCCCG ATACCGCGAG 
AAGGTTCGCC GCTGCCTCGA CGTCTTCGCC GAGATGCTGC GGGAGTCCCG ATTCGACGTC
GACCGGCCGA CGAGCGGGCT GGAGATCGAG CTCAACCTGA TCGACGATGA GTCGCTGCCA
GCGATGCGCA ACACCGATGT GCTGGCGGCG GTGGCCGACC CAGACTTCCA GACTGAGCTG
GGCCAGTTCA ACCTGGAGAT CAACGTGACT CCTCGGCGGC TCGCCGGCAC CGGGGCGTCG
GAATTCGAGC GGAAGGTACG CGACAGCCTC AACGCGGCCG AGGCCAGGGC ACGCACCGTC
GGCGCCCATC TGGTCATGAT CGGGATCTTG CCGACGCTAC GGCCGGAACA CCTGACCTCG
GCCACGCTCT CGGCGAATCC TCGCTTCGAA CTGCTCAACG AGCAGATCTT CGCCGCTCGC
GGCGAAGATC TGCGAATCAC CATCAATGGT GTGGAGCGGC TGGCGGTCAC AGCCGACACC
ATCACCCCGG AAGCGGCCTG CACCAGCACC CAGTTCCACC TCCAGGTCAG CCCTGCCCAG
TTCGCCGACT ACTGGAACGC CGCGCAGGCG ATCGCCGGTA TCCAGGTGGC GCTGGGCGCG
AACGCACCAC TGTTCTTCGG ACGGGAGCTG TGGCGGGAGA CGCGGATTCC CCTGTTCGAA
CAGGCGACCG ACACTCGAGC GGAGGAGATC AAGGCGCAGG GCGTCCGCCC CAGGGTGTGG
TTCGGCGAAC GCTGGATCAC CACGGTCTTC GACCTGTTCG AGGAGAACGT CCGCTACTTC
CCCGCGTTAC TGCCCGTCTG CGATCCGGAG GACCCCGCCG AGGCGATCGC CGCCGGCGCC
GTGCCGAGGC TCGCCGAGCT GCGCCTGCAC AACGGCACCA TCTACCGGTG GAACCGGCCG
GTCTATGACG TGCTGAACGG ACGGCCGCAC CTCCAGCTGG AGAACCGGGT GCTACCGGCC
GGGCCCACGG TCGTGGACAC CGTCGCGAAC GGGGCCTTCT TCTTCGGTCT CGTCCGCGCA
CTGGCGGAGA GTGACCGCCC ACTGTGGTCG CAGATGTCGT TCAGCGCCGC CGAGGAGAAC
TTCACCACCT GCGCCCGGTA CGGCATCGAC GCGCAGATCT TCTGGCCGGG CCTCGGCTAC
CTGCCGGTCA CCGAGCTGGT GCTACGCCGG TTGCTGCCCC TGGCCTACCA CGGGCTGAAC
CGCTGGGGAG TGGACCCGAA CGAGCGCGAT CGCCTGCTGG GCATCATCGA GCAGCGTTGC
CTCACCGGTC GCAACGGCGC CAGTTGGCAG GTGGCCACAC TGCACCGGCT TGAGTCCGCC
GACCATCTGG GGCGACCCGA GGCACTACGC GAGGTGGTCC GCCACTACGT GGATCTCATG
CACAGCAACC GACCGGCCCA CGAGTGGCCC CTGCCCTGA
 
Protein sequence
MGEEVDVRTF NRADRARYRE KVRRCLDVFA EMLRESRFDV DRPTSGLEIE LNLIDDESLP 
AMRNTDVLAA VADPDFQTEL GQFNLEINVT PRRLAGTGAS EFERKVRDSL NAAEARARTV
GAHLVMIGIL PTLRPEHLTS ATLSANPRFE LLNEQIFAAR GEDLRITING VERLAVTADT
ITPEAACTST QFHLQVSPAQ FADYWNAAQA IAGIQVALGA NAPLFFGREL WRETRIPLFE
QATDTRAEEI KAQGVRPRVW FGERWITTVF DLFEENVRYF PALLPVCDPE DPAEAIAAGA
VPRLAELRLH NGTIYRWNRP VYDVLNGRPH LQLENRVLPA GPTVVDTVAN GAFFFGLVRA
LAESDRPLWS QMSFSAAEEN FTTCARYGID AQIFWPGLGY LPVTELVLRR LLPLAYHGLN
RWGVDPNERD RLLGIIEQRC LTGRNGASWQ VATLHRLESA DHLGRPEALR EVVRHYVDLM
HSNRPAHEWP LP