Gene Sare_3253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3253 
Symbol 
ID5703735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3746405 
End bp3747382 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content75% 
IMG OID641272681 
Producthypothetical protein 
Protein accessionYP_001538048 
Protein GI159038795 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.584658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.947006 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCTCAC CTGCCCGTCG GCCCCCCGAC GCCACCCCCC GCGATCGCAC CGAAGCCGTC 
CTGTCCCGGC TGCACCTGCT CGTCACCCGC AAGCTCGACG GCCTGCTCCA AGGCGACTAC
GTCGGCCTGC TGCCCGGTCC CGGCAGCGAG GCGGGGGACT CGCGCGAGTA CCGCCCGGGC
GACGATGTTC GGCGGATGGA CTGGCCGGTC ACGGCGCGCA CCACGATGCC GCATGTACGG
CGTACGGTGG CCGACCGGGA GCTGGAGACG TGGCTCGCGG TGGACCTCTC GGCCAGTCTC
GACTTCGGAA CCGGACGGTG GCTCAAGCGC GACGTCGTGG TGGCCGCCGC CGCGGCACTC
GCCCACCTGA CCGCCCGGGG TGGCAACCGG GTCGGCGCGG TCATCGGCAC CGGGAGTGAG
CCGCCTGGGG GCGGGCGGCG TGCGCCGGCA GCCAGGGGTG GCGGGTTCAC CCGGTTGCCG
GCCCGGTCGG GCCGTCGGGA GGTGCAAGCC CTGGTCCGGG CGGTGGCCGG CACCGAGATC
CGGCCCGGGC GCAGCGACCT CGGTGCCCTC GTCGACCTGC TGAACCGGCC ACCCCGGCGG
CGTGGGGTGG CGGTCGTCGT CTCCGACTTC CTGGCGCCGC CGGCCCAGTG GACCCGCCCG
CTGCGCAAGC TGCGGGTACG TCACGACGTG CTGGCCATCG AGGTGCTGGA TCCGCGTGAG
CTGGAGCTAC CCGACGTGGG CGTCCTGCCG GTGGTCGACC CGGAGACCGG CGAGTTACAC
GAGGTGCGGA CCGGCGACCC GCGGCTACGT CACCGTTACG CCGAGGCGGC TGCCGCCCAG
CGGGCGGAGA TCGCCGCGGC GCTGCGTGCC GGGGGCGCCG CACACCTGAG GCTGCGGACC
GACCGAGACT GGCTGCTGGA CATGGTGCGT TTCGTTGCCG CGCAGCGGCA CGCCCGCACC
CGAGGGACGA CACGATGA
 
Protein sequence
MTSPARRPPD ATPRDRTEAV LSRLHLLVTR KLDGLLQGDY VGLLPGPGSE AGDSREYRPG 
DDVRRMDWPV TARTTMPHVR RTVADRELET WLAVDLSASL DFGTGRWLKR DVVVAAAAAL
AHLTARGGNR VGAVIGTGSE PPGGGRRAPA ARGGGFTRLP ARSGRREVQA LVRAVAGTEI
RPGRSDLGAL VDLLNRPPRR RGVAVVVSDF LAPPAQWTRP LRKLRVRHDV LAIEVLDPRE
LELPDVGVLP VVDPETGELH EVRTGDPRLR HRYAEAAAAQ RAEIAAALRA GGAAHLRLRT
DRDWLLDMVR FVAAQRHART RGTTR