Gene Sare_2554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2554 
Symbol 
ID5706408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2907564 
End bp2910359 
Gene Length2796 bp 
Protein Length931 aa 
Translation table11 
GC content68% 
IMG OID641272017 
Producthypothetical protein 
Protein accessionYP_001537387 
Protein GI159038134 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.521814 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000671119 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAAGCGCA GACGTCGCCA TCTACTCGCA GGACTAGCGG TCGCTGCACT CGTAGCGACG 
ACGGGGGTCG CCGGCATCCA CCTCGCCGGC ACCGAAGTGC CCGCCGCCGC GGCAGCGGCC
GGGGACGGCG AACTCCCGAA CGCCCTCGGG GCCCACCTGG AACGGCTGCG TCGGTCGGTG
CCGGGCAACG ACGGCATGTC GCCGGACGGT CCGGGTGGCG CCGCGGAGCA GGAGTTCCTG
AAGCGGGCGT ACCCGGCCAG CGACGTCAGC ATCGCCGAGA CCGACCGCTC CAAGGCGGCG
TACGCGGCGG CGGAGCGCCG GTTCCGGGGC GGTCAACGGT GGACCAACGT CGGGCCGAGT
GAGGCGCTGT ACCCGTTCAC CGAGTACCGC AACTCCTTCA GCTACGTACC CAACGAGTAC
GTCGCCGGCG GCCGGGTGAC CTCTCTCGAC ATCAGCCCCG GCTGCAACCG GTGGCTCTGC
CGGGCCTACG CCACCCCGTC CGGCGGTGGG GTCTGGGGCA CCCTCAACAT CCTCGCCGCC
GAGCCGAGGT GGTTCTACCT GGGTGGCCCG CTCGGCATCA ACGCTGCCGG CTCGGTCAAG
ATCGACCGTA ACGACCGGAC CGGGCTCACC ATCTACGTCG GCACCGGTGA GGCGAACATC
TGTGGCTCTG GCTGTGTCGC CGGCGTCGGC CTCTACCGGT CCACCAACGG CGGCCTGACC
TGGCAGGGCC CGCTCGGCAA GGATGTACTC GCCGGCAAGG GCATCGCCGA GATCACCATC
GAGCCGGACA ACCCGAAGAC CATGTACGTC GCCACGACCA CCGCGCTGCG CGGCATGTCC
AGCTCCTGCT GCACGGGCGT GACCCGGCCG GTGCCCGACG CGGAGAAGTG GGGCCTCTAC
AAGACCACCG ACGGCGGCAA GAACTGGAAC TTCATCCACA ACGGCTCGGC CGCCGCCACC
GACTGCACCG GCGACGCCGC GGAGTTCGCC AACAGTGGGG TCTGTTCGCC ACGCGGCGTC
CGCTACGTCA AGCTCGACCC GAAGGACTCC GACGTCGTGT ACGCGTCGTC GTACGGCCGC
GGCATCTGGC GCTCTCCCGA CGGTGGAGCC ACCTGGGCGC AGATCAAGCC GTCGCTCAAC
CCGGACCTGT TCCAGACCCG GGCCGCCATC GACGTCACCG CCCTGCCCAA CGGCAAGACC
CGGATGTACG TCTACGAGGG TAACTTCGGT AACCCGTACT CGCGCCTGCT GCGCAGCGAC
GACGTGGCCA GCGGAACCCC CACCTTCACC GACCTGACCA GCTCCAACCC GGCTGATCCG
GGCTTCGCCA CCTACAACCA GTGCACCGGC CAGTGCTGGT ACGACATGTT CGTGCACACC
CCGCCCGGTC ATCCGGACAT CGTCTACACC GGTGGCTCCT ACTCCTACGG TGAGACCGTC
GCGCACAAGC GGGCCGTCAT CCTCTCCACC GACGCCGGGG TCAGCGGCAC CGACATGACC
TTCGACGGCA CCGACGAACT GCACCCGAAC GGTCTGCACC CGGACCAGCA CGCGATCGTC
ACCAACCCAC GTGACCCGTA CCAGTTCTTC GAGGCCAACG ACGGCGGGAT CATGCGGTCC
AGCGGTGAGT TCGTCGATCG GTCCGCCTGG TGTGACGACC CGAACCGCAA CCTGACGACG
CAGGCCCAGC AGGACCGCTG CAAGCAGATG CTCTCCCGGA TCCCCTCCAA ACTGGAGGGT
GTCAACAAGG GCATGAACAC GTTGCAGTTC ATCAGTCTGT CGGTCAGCCC GCACGACGTG
AACCTGCTGC AGGGGGGCAC GCAGGACAAC GGCACCTGGG AGAACAAGGG TGAGCGGCGG
CGCTGGGTGA ACACGATGAT CGGTGACGGA GGCGCGTCCG GTTTCGACGT CGGCAAGCCC
GAGTTCCGCT TCCACACCTT CTTCAACGCG ACACCCGAGG TGAACTTCAA CAGCGGCGAC
ATCGCCGACT GGATCTGGAC GGCGGACCCG ATCTTCGGCC ACGCGGGCAC CCTGTTCTAC
GCCCCGGTCA TCAGCGACCC GAAGGTCAGC GGCACGATGT TCGCCGGCAC CGGCCGCACG
GTCTACCGGA CGAAGACCTT CGGTCTGGGC GACCGGAGCA TCGAGGAGGC TAACCGGATC
TGCAACACCT GGACCGGTAC GTTCGAGGAG CAGTGCGGGG ACTGGGCGGA ACTGGGCGCC
ACACCGCTCA CCGACGCGGC ATGGGGGGAC CGGGCCGGCG GCGCGGTCTC GGTGGTCCAG
CGGGTCGACA CCGACTCCTC GACGGCGTAC GCGGCCACCA GCACGGGGCG GGTCTTCGTC
AGCCACAACG TGGACGCCGA GCCGGCAGCC GCGGTGACCT GGACCCGGAT CGACAACTCG
GACACCCCGA ACCGGTTCGT CACCAGCGTC CACATCGACC CGGCCGACCC GAACCGGGCA
TGGGTGTCCT ACAGCGGCTT CAACTCGAAC ACGCCGGACA CCGTCGGGCA CGCGTTCGAG
GTGACGACCG CCGGTACGAC CGCGACCTGG ACCGACCGCT CGTACGACTT CGGCGACCAG
CCGATCACCG ACCTGGTTCG GGACGACGTC ACCGGTGACC TGTACGCGGC AACGGACTTC
GGCGTGTTGC GGCTGTCCAA GGGCCAGACC AGCTGGGCCA AGGCAGCCTG GGGCATGCCG
AACGTCGAGG TCGCCGGGCT CACTATCGTG CCGGGCGAGC GGATCCTCTA CGCGGCCTCG
CACGGTCTCG GTGCTTGGCA GCTCAAACTG AAGTAG
 
Protein sequence
MKRRRRHLLA GLAVAALVAT TGVAGIHLAG TEVPAAAAAA GDGELPNALG AHLERLRRSV 
PGNDGMSPDG PGGAAEQEFL KRAYPASDVS IAETDRSKAA YAAAERRFRG GQRWTNVGPS
EALYPFTEYR NSFSYVPNEY VAGGRVTSLD ISPGCNRWLC RAYATPSGGG VWGTLNILAA
EPRWFYLGGP LGINAAGSVK IDRNDRTGLT IYVGTGEANI CGSGCVAGVG LYRSTNGGLT
WQGPLGKDVL AGKGIAEITI EPDNPKTMYV ATTTALRGMS SSCCTGVTRP VPDAEKWGLY
KTTDGGKNWN FIHNGSAAAT DCTGDAAEFA NSGVCSPRGV RYVKLDPKDS DVVYASSYGR
GIWRSPDGGA TWAQIKPSLN PDLFQTRAAI DVTALPNGKT RMYVYEGNFG NPYSRLLRSD
DVASGTPTFT DLTSSNPADP GFATYNQCTG QCWYDMFVHT PPGHPDIVYT GGSYSYGETV
AHKRAVILST DAGVSGTDMT FDGTDELHPN GLHPDQHAIV TNPRDPYQFF EANDGGIMRS
SGEFVDRSAW CDDPNRNLTT QAQQDRCKQM LSRIPSKLEG VNKGMNTLQF ISLSVSPHDV
NLLQGGTQDN GTWENKGERR RWVNTMIGDG GASGFDVGKP EFRFHTFFNA TPEVNFNSGD
IADWIWTADP IFGHAGTLFY APVISDPKVS GTMFAGTGRT VYRTKTFGLG DRSIEEANRI
CNTWTGTFEE QCGDWAELGA TPLTDAAWGD RAGGAVSVVQ RVDTDSSTAY AATSTGRVFV
SHNVDAEPAA AVTWTRIDNS DTPNRFVTSV HIDPADPNRA WVSYSGFNSN TPDTVGHAFE
VTTAGTTATW TDRSYDFGDQ PITDLVRDDV TGDLYAATDF GVLRLSKGQT SWAKAAWGMP
NVEVAGLTIV PGERILYAAS HGLGAWQLKL K