Gene Sare_1067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1067 
Symbol 
ID5705680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1194478 
End bp1195899 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content69% 
IMG OID641270583 
Producthypothetical protein 
Protein accessionYP_001535967 
Protein GI159036714 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.119168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00309173 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCGAAA GCGTTCGACG CACCGTAGTC ACGGCACCAC GGCGCCACCC CCTCGCTGTA 
CTACGCGCCA AGGCCGGCCA CACCCATGGC GAGTACGCCC GCCTGATCGC CGAGACGCAC
GCCACGCTGG GGTTCGGCCA CATGGCGGCA CGTCGAGAAA AGGTCTCCCG TTGGGAAGCC
GGCCGAGCCA TCCCGGAACG GACCGCACAG CTCGCCATCG CCCACATCCA TGGCGTCGCC
CAGGAGCACG TCGACAATCG GGCCTGGCCC GAATGGCTCC ACCTGGCATA CGGCGACGCA
CGTCAGCTGG AGCTCCCCTG GACACCCGCG GCTGCCCCAG AGGCCATCCT CGACGCCGTA
GCGGAGCGAC AACAGGCGCA ACAGGGATAC CTGCTGGCAA CCGGACCGGC AGCCAAGTCA
CTCGCGGAGA ACTGGCAGGA CGCCATGACC GAGGCCCTGA CGAAGGTGCC GCAGCACCTG
CCACGACCCG GCCGGGTGCC CCTCATGTGG CAACGCGGAA CGGATGCCGA GCTGGGATCG
GTACTCCAGG CGTGCACCCG ACTGCGAACG CTACTCACGT TCGCGGGCTG GTTCACCGCC
GGATGGCTGG TGCCCGCCAG CGAGCAGGAA CTGCGGCATG TGGCCAACCA CTTCGCCACC
ACGACAGATG TGATCGCGGA GAAGAGCCGT GGGTTGCTGA CGCTCGCCGC AGAGGGACTG
TCCCTGTGCG GTTTCATCGC CCGCCTCGAG GGGGAACATG TCAGCGCCCA GCGGTACTAC
GTGGCGGGTC TGCGCTGCGC CACGGCCGCC GGCGCGGCGG AGCTCGCCGC GGCGATCATG
ACGATCCACG CTGCCCAGTA CCTGGACCTC GGGCTCCACG AGGAGGCCAC CGAGCTCCTG
ACGTCCGCAC AGACGTTCCT GCGTCGATCC CGGATACCGG TGCGGGATCC GGCGCTGCCC
ACGTTGATGC ACGCGCAGAT CGCCCGGGTG CACGCCCAAC TCGGTGACGA CCTTGGTCGG
CGCCGATCAC TCTCGGCGGG GCGCAACGCG TTGGAAAGCG TGCCGTACGG GGATCCCATG
GCGATCCTGC CGGCTCGCGG CAGCTGCTGG CTGCAGCTGA TGGACGGAGT GTCTCTCCTG
GAACTGGGTC GGCCGGACCA GGCGGTCAAG GCCTTCGATC CGCTGTTCTC CAAACACGTG
CCGGAGCTGA ACCTGCCGCC GTCGGTGCGC TCGCTGTACC TGCTGCGAGC CGCGGAGGCC
CAGGCGGCGG TCGGGGACAC GGTGGGGAGC GTGGAGTCAG TGGCCCAGGC CACGACGCTA
CTCGGCGGTG TTCGGGTGGC GGTCTCCAAA CACGTGCGAC TCGCGCTGCG CGCCTACCAG
CACCTGCCAG AGGTGAAGGC GCTGCTGTCG GCGTCCGACT GA
 
Protein sequence
MRESVRRTVV TAPRRHPLAV LRAKAGHTHG EYARLIAETH ATLGFGHMAA RREKVSRWEA 
GRAIPERTAQ LAIAHIHGVA QEHVDNRAWP EWLHLAYGDA RQLELPWTPA AAPEAILDAV
AERQQAQQGY LLATGPAAKS LAENWQDAMT EALTKVPQHL PRPGRVPLMW QRGTDAELGS
VLQACTRLRT LLTFAGWFTA GWLVPASEQE LRHVANHFAT TTDVIAEKSR GLLTLAAEGL
SLCGFIARLE GEHVSAQRYY VAGLRCATAA GAAELAAAIM TIHAAQYLDL GLHEEATELL
TSAQTFLRRS RIPVRDPALP TLMHAQIARV HAQLGDDLGR RRSLSAGRNA LESVPYGDPM
AILPARGSCW LQLMDGVSLL ELGRPDQAVK AFDPLFSKHV PELNLPPSVR SLYLLRAAEA
QAAVGDTVGS VESVAQATTL LGGVRVAVSK HVRLALRAYQ HLPEVKALLS ASD