Gene Sare_3103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3103 
Symbol 
ID5706577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3525823 
End bp3527433 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content60% 
IMG OID641272537 
Producthypothetical protein 
Protein accessionYP_001537905 
Protein GI159038652 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.186273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0036434 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGATATA AATCCAACGG CTTGGGTCGG AAGGCCCTGT CCATATTTCT CGCTTTTACT 
GTAGCGTCCC TAACGTTCAT CATCGCTGAT ATCCGGATGT CGCTGCCGGT TCAAGCGTCG
TCCACGGTCG GCGGAAGCAT TTCCCGTACC GAAGTGCTGA CCCGGGCGGA GTGGTGGGTC
AATACGTACG GCGTCATCTA TAGCCAAAAT CAGAATGACC AGAAGCCCGA CCCGGATGGA
CACCCCTACC GACCGGACTG CTCGGGATTC ATCTCGATGG CCTGGCACCT GCCAAAGAAG
AGTGACGGCT GGGATCGCAA TACTGGTGAC CTTGATGCCT TCGGTGACAC CACCTACCTC
AGCAACCTTG GGGAACTCCT TCCAGGCGAT GCGATCCTTG GCAAGAGTTA CGGGCACGTG
GCGCTCTTTG ACCGGTGGGC CAACCCGTCC CGTACTGAAA TGTGGATCTA TGATGAATAC
AAATCTGGAC GAGAGGGGAG GCACATCATC CAGTCGAGGA GTTGGTACGA GAGCGAGGGC
TTTCGTGGAC TGCGATACAA CAAAATCACC TCGATGATGC CGGATGCGCC GGATGCGGTG
TCGCGGGATG GGGTGGTGGT GTCGTCGTCG GGGCGGATTT CGGTGTATGC GGTGCGTGCT
GATGGTGATG TGTGGGGTCG TAGTCAGGAA TCGCCGGGTG GTTCGTTCAA TGCGTGGCAG
CGTTTGTCGA CCGGTGGTGG TTTTGCTGGT CAGGTAGCGG TGTTGCGGGA TGATCGTGAC
CGGGTGGCGT TGTATGCGCG GCGGAGTGGG ACGATATTCG GGGCGAGTCA GCAGGAAGTT
GGTGGATCGT TTGGTGTGTG GGGTCCGATC GGTACGAACG GTGCGGGGGT GACGGGGGAT
CCGCGGGCGG TGTATGCGTC TGAGGGGCGG ATCGCTATCT ATGCGACGAC GAGTAGTGGG
AATGTGTCGG GAGTGACGCA GACGCAGGCT GGTGGTGGGT TCGGTTCATG GCAGCAGTTG
ACCAGTGGTG GTGGCTACAT GGGTAAGCCA GCGGCGGTGG TGGATTCTCA GCAACGGGTG
GCGTTGTATG TGCGTCGGAA CGGCATGGTC TATGGGGCCA GTCAGTCGCA GGCTAACGGT
TCATTTGGGA CGTGGGCTGC CCGGGGTGTT GATGGTGCGG GTGTGGCCAG TGATCCGGTG
GCGGTGTATG GGGTCGGGGG TAGGATTGCT ATTTATGTCA CCAGCACTGC GGGGAACGTT
GCTGGGGTCA ATCAGGTAGC CGCTGGTGGT GAGTTCGGTG CTTGGCAGGT GTTGACCAGC
ACGGGTGGGT ATGAGGGCCG GCCGGCGGTG TTGGTTGACG AGCAGGGTCG GGTAGCGGTC
TACGTGCGTC GAAGTGGCGC GATCTACGGC GCTAGTCAGC CCGAGGCCGG TGGTCCGTTC
GGTGCCTGGG CTGCTCGTGG CACCGGTAGT CCCCAACTCA TCGGTGATCC CACTGCTGTG
TATGGCGTTG GTGACCGAAT CGCCCTGTAT GCCGCCGCTA CCAACGACAG TATCGGCGGT
GTTAGCCAGG GCGAAGCCGG CGGCACCTTC GGCAACTGGA TCGTCCTGTA G
 
Protein sequence
MRYKSNGLGR KALSIFLAFT VASLTFIIAD IRMSLPVQAS STVGGSISRT EVLTRAEWWV 
NTYGVIYSQN QNDQKPDPDG HPYRPDCSGF ISMAWHLPKK SDGWDRNTGD LDAFGDTTYL
SNLGELLPGD AILGKSYGHV ALFDRWANPS RTEMWIYDEY KSGREGRHII QSRSWYESEG
FRGLRYNKIT SMMPDAPDAV SRDGVVVSSS GRISVYAVRA DGDVWGRSQE SPGGSFNAWQ
RLSTGGGFAG QVAVLRDDRD RVALYARRSG TIFGASQQEV GGSFGVWGPI GTNGAGVTGD
PRAVYASEGR IAIYATTSSG NVSGVTQTQA GGGFGSWQQL TSGGGYMGKP AAVVDSQQRV
ALYVRRNGMV YGASQSQANG SFGTWAARGV DGAGVASDPV AVYGVGGRIA IYVTSTAGNV
AGVNQVAAGG EFGAWQVLTS TGGYEGRPAV LVDEQGRVAV YVRRSGAIYG ASQPEAGGPF
GAWAARGTGS PQLIGDPTAV YGVGDRIALY AAATNDSIGG VSQGEAGGTF GNWIVL