Gene Sare_3459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3459 
Symbol 
ID5708208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3988658 
End bp3989953 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content73% 
IMG OID641272886 
Producthypothetical protein 
Protein accessionYP_001538252 
Protein GI159038999 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.887952 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.027192 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGCCG GGCTGCGAGG GCTGACCACG CGGGGCCGGT CGTTCCTCGC CGCGGCGGTC 
GCCGCGGCGG TCTCGGCCAC CATCCTCGGC GAACGGGACC TGCTCCGGGT GGCCGTGCTG
CTCGCCGTCC TGCCGCTGCT GGCGGCGCTC TACGTCGGCC GCAGTCGATA CAAGCTGGCC
TGCAACCGGT CGCTGGACCC GGGGCGCGTA CCGGCCGGCG CCAGCGCCCG GGTGGTGCTG
CGCCTACAGA ACCTGTCCCG GCTTCCCACC GGGACACTTC TCTTGGAGGA CCAGTTGCCG
TACGCGCTGG GCAGCCGGCC CCGGGTGGTG TTGGACCGAC TCGGCGCGCA CCAGGCCAGT
TCGGTGGCGT ACACGGTGCG GGCCGACCTC CGCGGCCGGT ACACGGTCGG CCCACTGGTG
GTCCGGATGA CCGACCCGTT CGGTCTGTGC GAGCTGACCC GGGCCTTCCC CAGCACCGAC
CAGTTGACGG TGATCCCGCA GGTCTCTCCC CTGCCCATGG TCCGGCTTCC CGGCGAGTAC
GCGGGCAGTG GCGACAGCCG GGCCCGATCG GTGGCGGTGC ATGGCGAGGA CGACGCGGCC
ACCCGGGAGT ACCGGTTGGG CGACGACCTG CGGCGGGTGC ACTGGAAGTC GACCGCGCGC
ACCGGCGAGT TGATGGTCCG CCGCGAGGAG CAGCCATGGG AGAGCCGCGC CACGATTCTG
CTGGACACCC GGGCGTACGG GCACTGTGGC GACGGACCGA CGGCCAGCTT CGAGTGGGCG
GTCGCCGCAG CCGCGAGCAT CGCCGTGCAC CTGCGGCGCA GCGGCTACCG ACTGCGGCTG
GTGACCGGCT CCGGCGCCGA TATCGACGCG ACGGAGACGA CCGGTGACGG ACTGCTTCTG
GAGAGCCTCG CCGACGCCCG ACTCGACCAG CGGATCGAGA TAGCCACGCT GGTACGGAAG
GTTCGTCAGC GCACGGACAG CGGCCTGGTC ATCGGCCTGT TGGGGACGCT GAGCACCGTT
GAAGCCGAAC TGCTGGCCAC CCTGCGCGGT AGCGGCGCCA CCTGCGTGGC GTTCCTGCAG
GACAGCTCCA CCTGGCTGAC CATGCCGACA AGGGCCCGGA CCGAGGCCGA CGACGCACAC
GCCAGTGCCG CGCTCGCGCT GTTGCGCAGT GGCTGGCGTG TGATCGGCGT CGACCGTGGC
TCCCGACTAC CGGCACTGTG GCCGCAGGCC AGCCGGGGCT CGCAGGGGTT CGCCGTCCGG
GCCGCCTCGG CCGAGACCGT GGCCAGCAGA CGATGA
 
Protein sequence
MRAGLRGLTT RGRSFLAAAV AAAVSATILG ERDLLRVAVL LAVLPLLAAL YVGRSRYKLA 
CNRSLDPGRV PAGASARVVL RLQNLSRLPT GTLLLEDQLP YALGSRPRVV LDRLGAHQAS
SVAYTVRADL RGRYTVGPLV VRMTDPFGLC ELTRAFPSTD QLTVIPQVSP LPMVRLPGEY
AGSGDSRARS VAVHGEDDAA TREYRLGDDL RRVHWKSTAR TGELMVRREE QPWESRATIL
LDTRAYGHCG DGPTASFEWA VAAAASIAVH LRRSGYRLRL VTGSGADIDA TETTGDGLLL
ESLADARLDQ RIEIATLVRK VRQRTDSGLV IGLLGTLSTV EAELLATLRG SGATCVAFLQ
DSSTWLTMPT RARTEADDAH ASAALALLRS GWRVIGVDRG SRLPALWPQA SRGSQGFAVR
AASAETVASR R