Gene Sare_0216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0216 
Symbol 
ID5706120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp246774 
End bp248444 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content70% 
IMG OID641269745 
Producthypothetical protein 
Protein accessionYP_001535142 
Protein GI159035889 
COG category[S] Function unknown 
COG ID[COG4805] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0166769 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0154007 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGACGAA TCGATGATAT TGCGAACCGG TACGTGGCGG ACTGGGCCGC ACTGAACCCA 
ACCGGTGCCA CCTACGTCGG CATCCCGGGC CACGACGACC GGCTCGACGA TCTCTCGCCG
GACGGGTACG CCAGCCGCAC TGACCTGACC CGCCGAGTCC TCGCCGACGT CGACGCCACA
GAGCCGACGT CACCGGCGGA GCACACCGCG AAGGAGGCCA TGCAGGAGCG ACTCGGCCTC
GAGCTCGCCC GCTACGAGGC GGGAGAGGTG GGTCGCGGAG TCAGTGTCAT CACCAGCGGT
CTACACGAAC TCCGGTCCGT GTTCGACCTG ATGCCGACCG GCGGCGAGGG CGACCGGGCC
AACATCGCCG CGCGGCTCAA CCGCTTCGCC GAAGCACTCG AGGGATACAA GACCACGTTG
CGCGAGGCGA CCGACGCCGG CCAGGCCAGC GCCCAGGCGC AGCTACTCGA GGTGGCCAGG
CAGTGTGACG TCTGGGTGGA CCCGGACGGC GACAACTTCT TCCACGGGCT GGTCGAGCGG
CTGGACGAGG GGGGCACTCT CGGCGCCGAG CTGCGCCGGG GTGCCACGGC CGCCACCGCG
GCGACCGCCG AGTTCGGCCG ATTCCTCCGT ACCGAACTGG CCCCACGGGG TAGGACGAAC
CAGGCCGCCG GCCGGGAGCG CTACGAGCTG GCCTCGCAGT ATTTTCTCGG CGCGCGGGTC
GATCTCGACG AGACGTACGC CTGGGGGTTC GAGGAGCTGG CCCGGCTCGA GGCGGACATG
CGAACGGTGG CCGCGCGGAT CGTCGGTCCC GGCGCCACGG TCGACGAGGC GGTAGCCGCG
CTGGACGCGG ATCCGGCGCG GACCATCCAG GGTAAGGAGG CGTTCCGGGA CTGGATGCAG
GGCCTCGCGG ACAGGGCGAT CAGCGAGCTG CACGGCACCC ACTTCGACAT TCCGGAGCAG
GTCCACCGGA TCGAGTGTTG CCTGGCGCCG ACGAGTGACG GCGCGATCTA CTACACCGGT
CCGAGTGAGG ACTTCTCCCG CCCCGGCCGC ATGTGGTGGG CAGTGCCGCA GGGCATCAAC
GACTTCTCCA CCTGGCGCGA GGTCACCACC GTCTACCACG AGGGTGTACC CGGCCACCAC
CTTCAGGTCG CCCAGACCGC GGTCCGGGCG GAGACCCTGA ACCGCTGGCA ACGGTTGCTC
TGCTGGGTCT CCGGGCACGG TGAGGGCTGG GCCCTCTACG CCGAGCGGCT GATGGAGGAA
CTGGGTTACC TGGAGGACGC GGGCGAACGG CTGGGCATGC TCGACGGCCA GGCGCTGCGC
GCCGCCCGCG TGATCGTCGA CATTGGCATG CACCTGGAGT TGGAGATCCC GACCGACAAC
CCGTTCGGCT TCCACCCGGG CGAGCGCTGG ACACCGGAAC TGGGCTGGGA GTTCATGCGG
GCGCACTGTC GGATACCGGA TGAGGTCCTG CGCTTCGAGC TGAACCGCTA CCTGGGTTGG
CCCGGGCAGG CGCCGTCCTA CAAGGTTGGT GAGCGGATCT GGCTGCAGGC CCGGGCCGAC
GCGAAGGCCC GCAAGGGTGC CGACTTCGAC CTCCGGGAGT TCCACCGGCA GGCACTCGAC
CTGGGCTCAC TCGGCCTGGA CCCGCTGCGT CGGGCACTCG CCCGAATCTG A
 
Protein sequence
MRRIDDIANR YVADWAALNP TGATYVGIPG HDDRLDDLSP DGYASRTDLT RRVLADVDAT 
EPTSPAEHTA KEAMQERLGL ELARYEAGEV GRGVSVITSG LHELRSVFDL MPTGGEGDRA
NIAARLNRFA EALEGYKTTL REATDAGQAS AQAQLLEVAR QCDVWVDPDG DNFFHGLVER
LDEGGTLGAE LRRGATAATA ATAEFGRFLR TELAPRGRTN QAAGRERYEL ASQYFLGARV
DLDETYAWGF EELARLEADM RTVAARIVGP GATVDEAVAA LDADPARTIQ GKEAFRDWMQ
GLADRAISEL HGTHFDIPEQ VHRIECCLAP TSDGAIYYTG PSEDFSRPGR MWWAVPQGIN
DFSTWREVTT VYHEGVPGHH LQVAQTAVRA ETLNRWQRLL CWVSGHGEGW ALYAERLMEE
LGYLEDAGER LGMLDGQALR AARVIVDIGM HLELEIPTDN PFGFHPGERW TPELGWEFMR
AHCRIPDEVL RFELNRYLGW PGQAPSYKVG ERIWLQARAD AKARKGADFD LREFHRQALD
LGSLGLDPLR RALARI