Gene Sare_4159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4159 
Symbol 
ID5707708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4723511 
End bp4724716 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content63% 
IMG OID641273586 
Producthypothetical protein 
Protein accessionYP_001538939 
Protein GI159039686 
COG category[S] Function unknown 
COG ID[COG2311] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.776458 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000152397 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGTTGCCG TAGTTCCAGA TGTAGGTACT CGGGCGACCC CTGTCCGACT GCGCCTGCTG 
GGACCCGACC TGGCGCGCGG GGCCATGCTG CTGTTGATTG CACTGGCCAA TGTTGATGTC
TTCGCTTTCG GCTTCTTGCC AGGATTCCGG GGGTACCCCG CCGAGCAATC TATACTCGAC
AGTATTTTCA CCATGGCGCG GATGGTGTTG GTTGACGGCC GGGCCTTTCC GCTTTTCGCC
GCGTTGTTCG GCTATGGCCT CGTACAACTG CTGCAGAACC GGGGTTCTGC AGAACCCGCG
CGGTCTTCGC GGCTGTTGCG TCGTCGTGGC ATGTGGTTGG TGGTTATCGG ATTCGTTCAC
GGCATGCTTT TGTTCACGGC GGACATCGTT GCGCTCTACG GGTTGTGCGC CCTCGTCTTC
GCTGGGCTCG TGGTACGTCT CAGTGACCGT GGCCTGCTGA CCGTAGCCCT GTCGCTGGTG
GCACTCGCCC TGCTAACCGG TGCCGTCCGC GGACTGCCTG CGGAGGCTCT CGGCCAGGCA
GGCGTAGTCA CAGCGACACC GACCATATTC GGTGGTGACG TCGTCGGCGC GCTGCAGGCG
AGGATGAGTG AGTGGGCCTT GGGAGCGATA CGCCTATTCG GACTGATGCC GGCGGTGCTC
TTCGGTGTCT ACGCGGGACG TAAGTCAGTC TTGACCTGGG GGCCAGAGCG GAAGCGGGTA
CTTGGTCTGG TCGCGTTTAC CGGGCTGGCC GCCGGTATCG TTGCTGGGGT TCCTTCGGCG
CTGATGGCGG CATCGGTGTG GACTGACCCG CCAATCGGTA TCGCCGCGAT TGCGGGAACG
CTTCACCTGG CGGGCGGGTA TGCGGCTGCA GCCGGCTATC TGGCCCTATT TGCCCTACTC
GCGGCCACCG TGCGGCGGCC TCCAGGCCTG ATAGTGAAGG CGCTATCGGT GAGTGGGCAA
CGCTCCTTGA CGCTGTATCT TAGCCAGTCC TTGCTGTTCC TCGTTCTCTT CGATCCGGAC
TTTTTTGGGC TGGGTGACAG ATTCGGTATT GCGTTGAACT CTGCTGTGGC TGCCGGTGTC
TGGATCGTCG GCGTTCTCAG TGCGCTGTTG ATGGACAGGC TGTCCATCCG TGGCCCGGCC
GAGGTGCTGC TACGCAGTCT CACCTACTGG CCGGTGGCCA GATCAGCCGG TCCGCGTTCT
CGTTGA
 
Protein sequence
MVAVVPDVGT RATPVRLRLL GPDLARGAML LLIALANVDV FAFGFLPGFR GYPAEQSILD 
SIFTMARMVL VDGRAFPLFA ALFGYGLVQL LQNRGSAEPA RSSRLLRRRG MWLVVIGFVH
GMLLFTADIV ALYGLCALVF AGLVVRLSDR GLLTVALSLV ALALLTGAVR GLPAEALGQA
GVVTATPTIF GGDVVGALQA RMSEWALGAI RLFGLMPAVL FGVYAGRKSV LTWGPERKRV
LGLVAFTGLA AGIVAGVPSA LMAASVWTDP PIGIAAIAGT LHLAGGYAAA AGYLALFALL
AATVRRPPGL IVKALSVSGQ RSLTLYLSQS LLFLVLFDPD FFGLGDRFGI ALNSAVAAGV
WIVGVLSALL MDRLSIRGPA EVLLRSLTYW PVARSAGPRS R