Gene Sare_3334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3334 
Symbol 
ID5708289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3845329 
End bp3846903 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content60% 
IMG OID641272761 
Producthypothetical protein 
Protein accessionYP_001538128 
Protein GI159038875 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.176126 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000765121 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCGGTC AGCGCGGTGA GCAGCGGTGC CGCCGTTGCA GAACACGGCT GGCGCGCGAC 
AACAGAACCG GACACTGCGC GCCGTGCCAG CTAGCCGGCC GCGACCGGTT CGCACATCCA
CTCAGCGTGC CAGCCGAGTT CTGGGATCAT CCGGCGATTA GGGAGGCGAT AGCCGCACGT
CACATGGGAC GCCTCATAAG AGCGTACCGC TGCCATCCAA TTCACGGACT ACATCCGCTT
CCACAAGCGG TCGTGGCGGG CTGGCTCGGC GTCACCCAAG CGCAACTAAG CCGAATCGAA
AAACACTCAC CGGTCGTCCA CCTCGACCGC TTGATACTCT GCGCGAGAGC ACTTCGAATC
CCTGCGGACA GGCTATGGTT CGCTCTTCCC GAAAGCAACG GCGCATTCGG TGAAAAGGCA
CCGGCGGCGA ACGGTATAAT ATCGGATGGC GAATTCAGCC CGCCCCTACT TTGGGCGTCG
ACAAATACGG CTGAAATCGT TAGCCAATTT ACGAGAAGGG ATCTTACCGT GGACCGACGC
GAGGCAGCGA AAAATGTTGT CGGCGTCGTG TTCGGCGCGG CACTTCTTGA ACCTATGGAG
CGCTGGCTCG GTGATCCCGC ATCTGATCAT GGCGACGGTC GACCGAGTGG TGTGGGATAT
CAAGAGGTTG GCCAGATTGA ACTTGTGGCA CGAATGTTCC GGGAATGGGA CGATCAGTTC
GGGGGCGGAT TGCGGCGGAA AGCGGTTATC GGCCAGCTGA ACGAAGTTTC CGAACTTCTC
CGGGACTCCC ATCCAGCCGA AATCCGTCGC CGACTGTTCG GCACGGTAGC CCACCTCGCC
GAAACTGCGG CCGTCATGTC CTGGGATTCT GGACAGCAGG CACTCGCACA ACGGTACTAC
ATCCTTGCCC TGCATGCAGC GAAACCGGCC GGCGATTTCG CTTTCGCGGC GAACATTATG
GCCGGCATGG CTCGACAGCT TCTCTATCTC GGCCAGACAG GCGACGCCCT TGAGCTGATA
AGAGTCGCTC AGGACAGCGC CAAAGATGCG ACGTCAACCG TTCGGTCCAT GCTCTACACA
CGCGAGGCAT GGGCCTACTC AAAGCAAGGG CGCATCTCCG CCTTTCGACG TGCGACCGAT
AATGCCCAAG AAATGTTCGC TGCCGCTACG CCGGATGAAG ACCCGTACTG GATCACTTAC
TTCGATGCGG CTGAGTTGGC CGGCACAACC GGCGGCCGGT TCCTTGATTT GGCTCATACC
AACCGAGAGA TGGCGGACGA GGCTGCAGCC GAAATTGAGA GCGCGATCGA CTTGCGCCGT
CCGGGGCGTC TCCGAAGTTC CGCGTTGGAC CATATCGGAC TTGCGGAAGC GCGATTGATT
CAGGGCGAAT TGGACGAAGC GGTAAGGCTA GGGCACAGTG CCGCCGATGT TGTCGAGCAG
ACTTGTTCTG ACCGGGCTCG CGTAAAATTC GCCGAATTCC ACCAACACGT AGCCACCTTC
GCCGAAGTGG CGGCTGTCGC GGAACTGCGA GAGCGAATCG GCACCCTGCT GGCCAAGCCT
CCGACGACAC TATGA
 
Protein sequence
MIGQRGEQRC RRCRTRLARD NRTGHCAPCQ LAGRDRFAHP LSVPAEFWDH PAIREAIAAR 
HMGRLIRAYR CHPIHGLHPL PQAVVAGWLG VTQAQLSRIE KHSPVVHLDR LILCARALRI
PADRLWFALP ESNGAFGEKA PAANGIISDG EFSPPLLWAS TNTAEIVSQF TRRDLTVDRR
EAAKNVVGVV FGAALLEPME RWLGDPASDH GDGRPSGVGY QEVGQIELVA RMFREWDDQF
GGGLRRKAVI GQLNEVSELL RDSHPAEIRR RLFGTVAHLA ETAAVMSWDS GQQALAQRYY
ILALHAAKPA GDFAFAANIM AGMARQLLYL GQTGDALELI RVAQDSAKDA TSTVRSMLYT
REAWAYSKQG RISAFRRATD NAQEMFAAAT PDEDPYWITY FDAAELAGTT GGRFLDLAHT
NREMADEAAA EIESAIDLRR PGRLRSSALD HIGLAEARLI QGELDEAVRL GHSAADVVEQ
TCSDRARVKF AEFHQHVATF AEVAAVAELR ERIGTLLAKP PTTL