Gene Sare_2644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2644 
Symbol 
ID5703589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3011238 
End bp3014168 
Gene Length2931 bp 
Protein Length976 aa 
Translation table11 
GC content63% 
IMG OID641272102 
Producthypothetical protein 
Protein accessionYP_001537472 
Protein GI159038219 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00661972 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCGTAC TATCCATGGC GGGACTGTCG ATGTCGATGC CAGTGGCCGG TGCCGAGCCG 
GCAGCGGCAG AACCAGTCCG GCCGGGCGCG AAGGCGGGCT TCACCGCCCC GGACAGGGCG
CTGGGCGAGG ACTGGAAGAC CTCCAGCGAC ATCCTGGTCA CGGGGGCCGG TGACACCGAG
GGCTATCACC TCTACGTCGC AAGAGAGTCG GCAGCGTTCG GGTGGGCCAC CCTTGCCACC
CTGACATCCA GCGCGATCTC CGTCGGCCCC TGGACCGGCA ACGTCTGCGT GACCGGTTCA
GGGCGCTACG CGATCGCCGT CTTCGCGCCG AAGAAGGCTG CCAACAAACC GGAACTGTCC
CGTGCGGGAG CTCTGGCGGC CGTCGTGGAG ATCGCCACCG GCAAAGCGAC ACACGTCGCA
ACCGGGGTCG AACTGGCGTA CTTCAATCCA GGGTGTGGTC CCGACGACCG AGCGTTGCTG
ACCCGTTCGC TGGGTGACGA CTTCACGCTG TCAACCGAGT TACTGACAGT CGATGCAGCG
AGTGCCAAGG TCATCGCGAC TCGCCGAGTC GAGGGCCAGC TCACGACGCC CGCTCCCGGG
CCGAAAGGCG ACTACGGAAT TCTCGGCCGC CATCTGGTCC AGATTGACGA ACAGGGTCAG
GCGATCCGAC AGGCCAGCCT GACCGGGCAG CCCTTCGGAC TTCAGGCCAC CGCGCGAAAT
GGCGTCGATC TGGTCGCCGT ACACGGTGAC CGGGCGCTCG CCCAGCGATT CCACGATGGA
AGACTGACCA CTGTCGCCAC CGGCCCTTGG GACAAGCTCC AGTTGTTCGG GCAACAGGGC
GGTCACAACG CCCTGGTGGG TCAGGTGCAC GCACGTGCCA GCATGCCCGA GTTGAGGGTG
GTGCAGACCG CGCGTCAGAT TCGAGCGCTG TCCGAAGAGG GACACCTTGC CGCGGCAGAG
GTAGTCAGCC AGCAGAACAT GCGGGCGGCG GGTCAGCCGC TGCTTCCAGC GGACCCAGCA
GACGCAGGTG ACGTGCGCGT TTCCGTACAG GCGACGGCTA CCGGCGAGAA GGCCACCCGC
ACGTTCAATA CGACGGCCGC GCCCACACTC GATGTCGCGC TGAACACGAA TGCCCCCGCC
GAAGCAGGGA CATCCCGGGT GGTCGATTCT GAGGCGTTGA CGCCAACCTG CGCGGTGCCG
CGTAACGACC CCCGTGTGCA GCCTCTCCAA CCAAGTCCAG ATATGGTGGA GTGGGCCGTG
AACCAGGCAG TGCACGGCCG GCTCAACGTC AACCGCCCGG CGAACTACCT GAAGGCCGGC
CTACCGGCCT ACCAGCCACA ATCTTTGTTT CCAGCCCGTA CCCTGATAGG GGGCGGAAAG
GTCCCAGCGC AGGTCATGCT TGCCATCCTG GCGCAGGAGA CGAACCTGTC CCAGGCATCG
TGGCACGTGG TGCCTGGTGA CACGGGTAAC CCCCTGATCG CCAGCTACTA CGGCAACCAC
GACAACCTCG ACGTGATTGA CTACAGCAAG ACCGACTGCG GGTACGGCAT CGGTCAGGTC
ACCGACGGCA TGAGGGTAGG AAGTGCGCTG TTCACCGAAA CGCAGCGCAG GGCGATCGCG
GTGGACTACG CGGCAAATAT CGCCGCCGGA ATGAACATCC TGATCGAAAA GTGGAATCAG
ATGGCCGGCG AGTTGTCCGC GCACCAGAGC TACATGAACA ACAACGATCC GGCCTTCGTG
GAGAACTGGT TCCTAGCGGC TTGGGCCTAC AACAGCGGAT ATTATCCGTA CACTACTCGC
AACAGCGAGC TACAAAACGG CCGGTATGGT ATTGGCTGGT TCAACAATCC CGCCAATCCT
CGATACCCCG CGAACCGCGC ACCATTCCTG CGGTTGACCC CAGCTGATGC GGAACGCCCC
AACGAATGGG CTTACCCGGA GCGAATCATG GGCTGGGCGG AAACACCGCA GCTCAAAGGA
TTTCCGGTAA TGACGCAGGC GTACGCGGAG CCGGATCATG GGGCGAACTC GCCGCGCACC
GGACCTCAGG GGATCAATCA GGTTCTCTCA ATTCCGGACC GATACGAGTT CTGTTCGACG
GTCAACAACT GCTCCGAAGC CACCAACGGA TGCCCGGCCG AGTCGGAGTT GTGCTGGTGG
CACGGCGCGG CGAATTCGGG CAACTGTCCG ATGGACGAGT GCGCGAAGGA AAAGCTCACC
TTCAGTGCAG GCGCCCCCGA GCCAGGAGTG AAGCGGATCT ATGAACGCAA CTGCGAGACG
TTCACTGGCG AGAAAAACGG AAACCGGGAT CCAAGCCGAG ACGTCTCCGT GGTCTATACG
TTGAACGACA CCGGACAGTA CAATCTCGGA TGCGACATTG GTGAGTCTGA CGGCAAGTTC
ACGATTCGCC GGGGCCATCC GGCCGGCAGC GGCAGCAGCG CACCCTACGC CGAGATCGAT
CTCCACCAGA TCGGTGCCGG CTACAAGGGA CACATCTGGT ACACGTACGT CAACCCGGGA
AATCCCAAAC GCCGGATAGT TGGGTCCTGG ACCCCGAATC TCGATCTGGC TCCAGGAGAG
AAGGCCCGCT ACGACATCGT CGCCCACGTA CCCAGCCACG GAGCCGACTA CGACGCGGTG
GAATACCTCA TCACCCGAGG AGCCATCCTG GGGCAGGCGA CATGTGATAT CGATTTCGCC
GAGGAGGCTG GCTGGTCGGT CTGGCCAGGC GTCCCTGACC CTAACCCTTT CAACCTGGGC
GAGGATAAGT GGGTCTACCT GGGTTCCTAC GAGTTGGGCC GGGGTGCTCA GGTCCAGTTG
AATAACATCG GTAACGAGAC TATCAATGGC TTCGACGCGG TCGCGTTCGA CGCAATGGCG
TTCGTCCCGA TCGGGAACAA CCCGGGGCAC TCATGCGGTG ACGACTACTA G
 
Protein sequence
MTVLSMAGLS MSMPVAGAEP AAAEPVRPGA KAGFTAPDRA LGEDWKTSSD ILVTGAGDTE 
GYHLYVARES AAFGWATLAT LTSSAISVGP WTGNVCVTGS GRYAIAVFAP KKAANKPELS
RAGALAAVVE IATGKATHVA TGVELAYFNP GCGPDDRALL TRSLGDDFTL STELLTVDAA
SAKVIATRRV EGQLTTPAPG PKGDYGILGR HLVQIDEQGQ AIRQASLTGQ PFGLQATARN
GVDLVAVHGD RALAQRFHDG RLTTVATGPW DKLQLFGQQG GHNALVGQVH ARASMPELRV
VQTARQIRAL SEEGHLAAAE VVSQQNMRAA GQPLLPADPA DAGDVRVSVQ ATATGEKATR
TFNTTAAPTL DVALNTNAPA EAGTSRVVDS EALTPTCAVP RNDPRVQPLQ PSPDMVEWAV
NQAVHGRLNV NRPANYLKAG LPAYQPQSLF PARTLIGGGK VPAQVMLAIL AQETNLSQAS
WHVVPGDTGN PLIASYYGNH DNLDVIDYSK TDCGYGIGQV TDGMRVGSAL FTETQRRAIA
VDYAANIAAG MNILIEKWNQ MAGELSAHQS YMNNNDPAFV ENWFLAAWAY NSGYYPYTTR
NSELQNGRYG IGWFNNPANP RYPANRAPFL RLTPADAERP NEWAYPERIM GWAETPQLKG
FPVMTQAYAE PDHGANSPRT GPQGINQVLS IPDRYEFCST VNNCSEATNG CPAESELCWW
HGAANSGNCP MDECAKEKLT FSAGAPEPGV KRIYERNCET FTGEKNGNRD PSRDVSVVYT
LNDTGQYNLG CDIGESDGKF TIRRGHPAGS GSSAPYAEID LHQIGAGYKG HIWYTYVNPG
NPKRRIVGSW TPNLDLAPGE KARYDIVAHV PSHGADYDAV EYLITRGAIL GQATCDIDFA
EEAGWSVWPG VPDPNPFNLG EDKWVYLGSY ELGRGAQVQL NNIGNETING FDAVAFDAMA
FVPIGNNPGH SCGDDY