Gene Sare_3716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3716 
Symbol 
ID5705509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4268511 
End bp4269725 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content66% 
IMG OID641273136 
Productportal protein 
Protein accessionYP_001538500 
Protein GI159039247 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones144 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGATCT GGTTCCGTCA GACCGAGCGC CGCCTCAACG GCTTCTCGAT CACGTCCTGG 
GCATCCACCG CGCCTGTGTC GCCGGCCGCC GGCGCCCGTA ACTCGCTGGC CGCGCTGACC
GCCTCATCTG AGGTGAGCTT GCAGACAGTG GCCATGTGGT CGGCCGCCGA CCTGATCTGT
TCGCTCGTCT CCGAGCTGCC GATCGACGTG CTCCGTGGGT CCGGAGCCGG CAAGAAGATC
CTTGCGGTGC CGTCGTACCT GCAGGACCCG GGTGGCGAGG GGTACGGCGT TGAGGACTGG
ATCTCGCAGC TACTCATGTC GTGGGTGCTG CGCGGCAACA TCAACGGGCG GGTACTCGGC
TGGTCGCGTG ACTCCGACAG CGCCTGGCCG ACACAGATCC AACTGCTACA CCCGGACTCG
GTAGCCGGCT GGTACGACGA GGGCGGCCGT CCGGTGTGGC GTGTGGCCGG CGGCGACATA
CCGCCCGGGC AAATGTGGCA TCGCCGGGTG CACACCGTTC CCGGTCGGCT GATGGGCCTG
TCGCCGGTGC AGCAGCAGGC CGCCACGCTC GGTCTCACGC TCAGCGCTAC CCGCTTCGGC
CTCGACTGGT TTACCGAAGG CGCCCACCCG TCGGCGGTGA TGCAGTCGAA GTCCCGCAAC
CTAGGCCCCG ATGGCGCGAC GACCGCGAAA CAGCGGTTCA TGGCGGCGAT CCGCGGCCGT
GAACCCGTGG TGATCGATGG GGACTGGCAA TACACGCCAA TCCAGATCAA CCCCGAGGAG
TCCCAGTTCC TGGAGACCAA CAAGTTCTCG CAGGCCCAGG TCGCGCGGAT CTTCGGCCCG
GGAATGGCCG AAATCCTCGG CTACGAATCC GGCGGATCAT TGACCTACTC CACTGTTGAG
GGACGCTCGC AGCACCTGTT GGTGTACGCG CTGAACAAGT GGATGCGCCG GGTCGAACGT
GTGCTCAGCT CGATGCTGCC GCGTGGTCAA TGTGCTCGTC TCAATCGCTC CGCCCTGCTG
GAGCCGACCG TGCTCGACCG GTGGCGCGTC TACCAGATTC AGCTTGCTAC CAAAGCGCGG
GCGATCAACG AGGTGCGCGA CGACGAGGAC TGGACGCCCG TGCCATGGGG CGAACAACCA
GCCGTGTCGA TGCCCACTCA ACAGCCGGCC GATCCCGACC CGAACGAACC GATGGGAGAC
CCCAGTGCGA AGTAA
 
Protein sequence
MGIWFRQTER RLNGFSITSW ASTAPVSPAA GARNSLAALT ASSEVSLQTV AMWSAADLIC 
SLVSELPIDV LRGSGAGKKI LAVPSYLQDP GGEGYGVEDW ISQLLMSWVL RGNINGRVLG
WSRDSDSAWP TQIQLLHPDS VAGWYDEGGR PVWRVAGGDI PPGQMWHRRV HTVPGRLMGL
SPVQQQAATL GLTLSATRFG LDWFTEGAHP SAVMQSKSRN LGPDGATTAK QRFMAAIRGR
EPVVIDGDWQ YTPIQINPEE SQFLETNKFS QAQVARIFGP GMAEILGYES GGSLTYSTVE
GRSQHLLVYA LNKWMRRVER VLSSMLPRGQ CARLNRSALL EPTVLDRWRV YQIQLATKAR
AINEVRDDED WTPVPWGEQP AVSMPTQQPA DPDPNEPMGD PSAK