Gene Sare_4805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4805 
Symbol 
ID5708141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5435780 
End bp5437216 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content71% 
IMG OID641274203 
Producthypothetical protein 
Protein accessionYP_001539548 
Protein GI159040295 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0850035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.646806 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGCTCG ATCCACTGCG AATACCGGAG GATGCGTGGT CGCACGGCGC GGTGCTGACC 
GCCTTGGCCG CCCGCGACAT CGGCGCGCTG TTCCGTCTGA TCGCCAGCCT CACCGGGGCG
AGTCAGAGCC AGATCGGGGC CGCCGTCGGG CTGGAACAGG GCTACGTCAG CCGCATCATG
GCCGGCCGCA AGGTCACCTC AATCGACGTC CTGGAGCGGA TCGCCGACGG CTGCCGGATA
CCCAACCAGG CGCGAATCAC GATGGGCCTG GCGCCCCGAC AGGCCACATC TCCGGCCACC
CCCGGCCGGC TCACCCCCGG TGGGCTCACT CCCGGTCGGC CCAGCGAACC GCCGGCCAAC
CGGACCTGGC AAGACGACGT ACGCAGTGCC GCAGAGCTTT GGCGAGGTGA CGTGAACCGT
CGAGACGTAC TCCGGCAGGT GGCCTTCCAC ACCGCCGGCT ACACCCTGCC CGCGCTGCGC
TGGTTCACCG CCCCCGACCC GGCTCCCGTC ACCCAGCCCG GCCGCAACGC GATTGGCCAA
CCGGAGATCG ACACCATCCG CGCCATGACC GCCACCTACC GCCAGCTGGA CAACCAGTAC
GGCGGCGGGC ACGCCCGCGA CACTGTCGCC CGCTACCTCC ACCAGGAGGT GACCCCGCTA
CTCACCGACG GCCGCTTCGA CCACCCCACT GGTCAACGGC TACTCGGCGC CGCCGCCGAA
CTGGCCCAAC TCGCCGGCTG GCAGGCGTAC GACACCGCCC AACACGGCAT CGCCCAGCGC
TACCTCACCC TCGCCCTGGA CTTCGCCCAC GCCGCCGCAG ACGACGGCCT CGGCGCGGAG
ATCCTCGCCG CGATGAGCCA CCAGGCCACC TACCTCGGTC ACACCACCGC CGGCCTCGAC
CTCGCCCGCG CCGCCGGCCA GACCGCCCAC CGCGCCGGAC TTCCCGCCCT GACCGCCGAA
GCCCACGTCA TGCAAGCCCA CGCCCTGGCC AAAGCCAACG ACGAACGAGC CTGCGCCACC
GCCCTACACA AGGCAGAACA AGCCCTCGAC CGAGCCGACC GCAGCACCGA CCCGCAATGG
CTCAGCTACT TCGACGAGGC ATACCTGTCC GCCAAGTTCG GCCACTGCTT CCACGCCCTC
GGCCGCAACA CCCACGCCGA ACGCTTCGCC GCCCGATCCC TACGCATGAA CGATCGCTTC
GTACGCGGCA AAGCCTTCAA CCTCGCCCTA CTCGCCAACA TCCACGCCCA CCAGGGGCAA
CCCGAACGGG CCTGCAACGT CGGCGCGCAA GCCCTGACCC TGACCACCCA ACTCCGCTCC
ACCCGAGCCG TCCGCTACCT CCGCGACCTG CAAACCCAAC TCGCCCCGCA CCGGCGACTA
CCCGCCGTCC GGCACTTCAC CGGCCGCATC AACGCCACCC TCGGCCCCCG CCGCTGA
 
Protein sequence
MLLDPLRIPE DAWSHGAVLT ALAARDIGAL FRLIASLTGA SQSQIGAAVG LEQGYVSRIM 
AGRKVTSIDV LERIADGCRI PNQARITMGL APRQATSPAT PGRLTPGGLT PGRPSEPPAN
RTWQDDVRSA AELWRGDVNR RDVLRQVAFH TAGYTLPALR WFTAPDPAPV TQPGRNAIGQ
PEIDTIRAMT ATYRQLDNQY GGGHARDTVA RYLHQEVTPL LTDGRFDHPT GQRLLGAAAE
LAQLAGWQAY DTAQHGIAQR YLTLALDFAH AAADDGLGAE ILAAMSHQAT YLGHTTAGLD
LARAAGQTAH RAGLPALTAE AHVMQAHALA KANDERACAT ALHKAEQALD RADRSTDPQW
LSYFDEAYLS AKFGHCFHAL GRNTHAERFA ARSLRMNDRF VRGKAFNLAL LANIHAHQGQ
PERACNVGAQ ALTLTTQLRS TRAVRYLRDL QTQLAPHRRL PAVRHFTGRI NATLGPRR