Gene Sare_4871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4871 
Symbol 
ID5707563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5522739 
End bp5524403 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content65% 
IMG OID641274267 
Producthypothetical protein 
Protein accessionYP_001539612 
Protein GI159040359 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.213174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.122523 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTGC GGGATCTGCT GGAATTCCAC GCCGACCCGT GGCATACGTC CGCGCAGGTG 
TGGCATCGGC TCGCTCAGGG CATCGACGAC ACGGCGGAGC AGGTGATCCG CAGTACGCGC
GATGTCGGTG ACGCCTGGCC GGGCGGTGCC GGCTCGGCGG CAGCGGTGCG GAGGGCCACC
GCGTTACGGG CTGAGGTGAG TAACGCGTAC AACCCGGCGA AACGCATCGC GGACGCGATG
GAGCACCACG CGTACGCGAT GTCGGGGCTG CGGCGGCAGG CGGAGGAGAT CGTCGCCTCG
GCGCGGCAGG CTGGCTACCA CGTCGACCTG GTGACCGGGG TGACCACCGC GCCACCCTCG
GCGGACATGG CTGGCGGCCT GGATCGGTCC AGCCGATCGA CCGAATCGGT GCTCCAGTAT
CTGCCGACGG TGGTGGACCA TGCCCGCGCG CAGGACGACG CGACAGCCAA CAGGATCTCC
GTCAATGTGC CGTCGCCGGG GGCTGGCTTC GGGACCGGCC AACTCGACGG CGTTTCGCGC
GCGGTGCTCG AGGCGCAGTC GGAACGGAGC CCGGCCGAGA TCCACGCGTG GTGGGAGTCG
CTGACTCCGC TACAGCAAGA GCAGGTGTTG CGGGAGTTCC CGGAACTGGT CGGCCGGATG
GACGGCATTC CGGTATCCGA CCGGGATGTG GCCAACCGCA GCGTCCTAGA GCGCGAACGC
AGTCTGTTCC AGCAGCAGCT GAGCACGATC GAGGCCAGAG AAGATTTCCT GTGGATAATC
CTCCAGCAGG GTCGTTTTTC AGAGGTCTAC CCGGACGCCG AGGACCCGAG GACTGCATTG
GAAAACGAGT TGAGGAAGCT CGCGTCCGAG CGGGTTGAGC TGCCCGGCAA GCTGCGTGGC
ATCGACGCGA TCACCGCTCG GCTGAATGAC GTCAGCCTCC CTGAGGCATA CCTGATCGGT
TACTCCAGCG ACGGTGACGG CCGAGCGATC GTCTCGGTCG GTGACCCGGA CACCGCCGAC
AACGTGCTCA CCTACGTACC CGGCACCGGC GAGCACCTGT CCAAGGTCGG TGCCGGCCTC
GAGCGTGCCG ACATCATGGC CAGGGACGCA CTCAAAGCGG CCCCGGACGA GAACACCTCG
GTGGTCTACT GGTATGGCTA CGACGCCCCG AACACGATTT TTCCTGATGC CGGCTTGGAC
TCCTATGCCG AGGGCGGCGG CCCGCTCCTC GACACCTTCC AAACCGGACT TCGGGCTACT
CACGACGGCG GCATCCCGTC GCACAACACC GTGCTCGGCC ACAGCTACGG CTCTACCGTG
ATTGGCCATG CCGCTAAGGA AAGTACCTTT AACGCTGACG CTCTGGTGTT CGTCGGCTCT
CCCGGCGTTG ATGTAAACCA CGCCTCTGAG TTAAATGGTG TGCGACCCGG TCAGGTCTGG
GTTACTACAG CGGAGAATGA CATCATACGT CGGGTACCCG ACTGGGATTT TATTCATGGT
AACGACCCCA GTGACCGCGA TTTTGGAGCA CGAGTCTTCG CCAGCGACCC CGGCAACCCT
GACGACGAAG CAGGCACCCA CTCCGCCTAC TGGGACCAGG ACAACATCGC GCGAAAGAAC
ATAGCGCGGA TCGTCACGGA CAGTCCCGTC CGCCTGCCTG AATAG
 
Protein sequence
MTLRDLLEFH ADPWHTSAQV WHRLAQGIDD TAEQVIRSTR DVGDAWPGGA GSAAAVRRAT 
ALRAEVSNAY NPAKRIADAM EHHAYAMSGL RRQAEEIVAS ARQAGYHVDL VTGVTTAPPS
ADMAGGLDRS SRSTESVLQY LPTVVDHARA QDDATANRIS VNVPSPGAGF GTGQLDGVSR
AVLEAQSERS PAEIHAWWES LTPLQQEQVL REFPELVGRM DGIPVSDRDV ANRSVLERER
SLFQQQLSTI EAREDFLWII LQQGRFSEVY PDAEDPRTAL ENELRKLASE RVELPGKLRG
IDAITARLND VSLPEAYLIG YSSDGDGRAI VSVGDPDTAD NVLTYVPGTG EHLSKVGAGL
ERADIMARDA LKAAPDENTS VVYWYGYDAP NTIFPDAGLD SYAEGGGPLL DTFQTGLRAT
HDGGIPSHNT VLGHSYGSTV IGHAAKESTF NADALVFVGS PGVDVNHASE LNGVRPGQVW
VTTAENDIIR RVPDWDFIHG NDPSDRDFGA RVFASDPGNP DDEAGTHSAY WDQDNIARKN
IARIVTDSPV RLPE