Gene Sare_3474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3474 
Symbol 
ID5708076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4006327 
End bp4007454 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content73% 
IMG OID641272901 
Productglycosyl transferase family protein 
Protein accessionYP_001538267 
Protein GI159039014 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.288784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0309659 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCTGT TGCTGATGGT GCTGGCCGGC GTGGCCGCGC TGACCGCGCA CACCCTGGTC 
AACGCCGGTC GTTGGCTGCG CCGCCCGACT GGGACGCCAG CCGAGGTGAC CGAGCCGGTG
GCGGTGCTGC TGCCGCTGCG CGACGAGGCC ACCCGGGTAA CCCCGTGCCT ACGCGCGCTG
CTGGCTCAGC GTGGTGTGCC GGAGCTACGG ATCGTGGTGC TCGACGACGG GTCGACCGAC
GGCACCCGCG AGGTCGTCCA CGCGGTCGTC GGGGACGACC CACGCGTCAC CCTGCTCGAC
GGGGGCGCCC CACCCCCCGG CTGGCTGGGC AAACCACACG CCTGCTGGCA GCTCGCCACC
CGGGCCGACC CGGCCGCCAC CGTGCTGGTC TTCGTGGACG CCGACGTCGT GCTCGCCCCA
CACGCCGTGG CCGCGGCCGT CGGCGAACTG CGCGCCGCGC GGGTGGCACT GCTGTCGCCG
TACCCCCGAA TCCTGGTCAC GACGGTGGCC GACCGGCTGG TCCAGCCGCT GTTGCAGTGG
TTGTGGCTGA CGTTCCTGCC GCTGCGCGCG ATGGAACGGT CGGCCCGACC GTCTCTGGCC
GCGGCCGGTG GGCAGTTCCT GGTCGTGGAC CGGATCGGTT ACACCGCCGC GGGCGGGCAC
GCGGCGGTGG CCGACCGGAT CCTGGAGGAC GTCGAACTGG CCCGGGCGGT CAAACGGTCC
GGCGGCCAGA TAGCTCTCGC CGACGGCTCA CGGCTGGCCA CCTGCCAGAT GTACGACGAC
TGGCCGCAGT TGCGGGACGG CTACTCCAAG TCGCTGTGGG CCTCGTTTGG CCGTCCTACA
GCGGCTGCCA CGGCGGTAGC GGTGCTCCTG CTGCTCTACA CCGCCCCACC GCTGGTCGCT
GTGGTCACGT GGGCCAGTGG TGCACCGGGG ACGGCCACCG TCGCCGCCGG GACGTACCTG
CTCGGGGTCG CCGGACGGGT GGTCAGCGCC CGAGCGACCG GCGGCCGGTG GTGGCCTGAC
GCGTTGGGCC ACCCCGCGTC GGTCGCGGTC CTCGGTTGGC TGACCCTGCG GTCGTACCAT
CTGCGGAAGC AACGGCGCCT GAGCTGGCGG GGCCGTCCGG TCGTCTAG
 
Protein sequence
MILLLMVLAG VAALTAHTLV NAGRWLRRPT GTPAEVTEPV AVLLPLRDEA TRVTPCLRAL 
LAQRGVPELR IVVLDDGSTD GTREVVHAVV GDDPRVTLLD GGAPPPGWLG KPHACWQLAT
RADPAATVLV FVDADVVLAP HAVAAAVGEL RAARVALLSP YPRILVTTVA DRLVQPLLQW
LWLTFLPLRA MERSARPSLA AAGGQFLVVD RIGYTAAGGH AAVADRILED VELARAVKRS
GGQIALADGS RLATCQMYDD WPQLRDGYSK SLWASFGRPT AAATAVAVLL LLYTAPPLVA
VVTWASGAPG TATVAAGTYL LGVAGRVVSA RATGGRWWPD ALGHPASVAV LGWLTLRSYH
LRKQRRLSWR GRPVV