Gene Sare_4060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4060 
Symbol 
ID5704143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4617004 
End bp4618110 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content70% 
IMG OID641273486 
Productglycosyl transferase group 1 
Protein accessionYP_001538841 
Protein GI159039588 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.586915 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.234234 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCGCA ACCGCGGGGA CAACAGCTCG GCCGGACGGC CACTACGGAT CGCGATGGTG 
GTCCCGCCGT GGCTGTCGGT GCCGCCGCCC GGCTACGGCG GCTTGGAGCA CGTGGTCGCC
GGCCTGGTGG ACGGGCTGAT CGCCCGGGGC CACACGGTGA CCCTGTTCGG GGCGGGTGAG
CGGACCGACA CCGCCGCCCG TTTCGTCTCG ACCGACGCCG AGCTGAAGTT CCAGCGGATC
GGCGAGGCAC TGCCCGAACT GGCCCACCTC GTTCAGGTGA ATCAGCTCGT TGGTCCGGAG
CAGTTCGACG TCGTTCACGA CCACACCACG ATCGGTCCCC TGCTGGCCGG GCGGCGGGCG
GTGCCCACCG TCGCCACCGT GCACGGCAAT CCGGTCGGGG AGTACGGGAC CGTACTCGGT
GACATCGACC GGGGCGTGGG CCTGGTAGCC ATCTCCCACG CCCAACGGCG GCTCAACCTG
CGGCTGCCGT GGGTCGGCAC GGTGCACAAC GCGCTGGACG TTGACGACAT CCCGCACAAG
CGGACACCGA GCCACGGGCC GGTGCTCTGG CTGGCCCGGT TCAGTCCGGA CAAGGGTCCC
GACCTCGCCA TCCGCGCCTG CCGGAGCGCC GGCCTGCCGT TGGTGCTCGC CGGAAAGTGC
AACGAACCGG ACGAACGCCG CTACTACCAC GACGTGGTGC GGCCGATGCT GGGCGACGAC
ATCACGGTGG TCCTCGACGC TGACCGGCGG GACGCGTTCC GCCTGCTCCT CGAAGCCCGA
TGCCTGGTCA TGCCGATCCA GTGGGAGGAA CCGTTCGGCA TCGTCATGCT GGAGGCGATG
GCCACCGGAA CCCCGGTGGT GGCACTACGC CGGGGTGCCG TGCCGGAGCT GGTCGTGCCC
GGCCGCACCG GTCTGATCTG CGAGCACGTG GACGAACTGC CGGGGGCGCT GCGCGCGGCG
AGTCGACTGG ATCCGGGCGT GTGCGTCGCC CATGTGGTGG AGAACTTCTC CACCGCCCGG
CTGGTTGATG GCTACGAGAC AGTGTTCCAG CGGTTCGTCT CGGCAGTGGT CCCGGCACGG
GAACCCGCCC CCATCACGTT CCGTTGA
 
Protein sequence
MARNRGDNSS AGRPLRIAMV VPPWLSVPPP GYGGLEHVVA GLVDGLIARG HTVTLFGAGE 
RTDTAARFVS TDAELKFQRI GEALPELAHL VQVNQLVGPE QFDVVHDHTT IGPLLAGRRA
VPTVATVHGN PVGEYGTVLG DIDRGVGLVA ISHAQRRLNL RLPWVGTVHN ALDVDDIPHK
RTPSHGPVLW LARFSPDKGP DLAIRACRSA GLPLVLAGKC NEPDERRYYH DVVRPMLGDD
ITVVLDADRR DAFRLLLEAR CLVMPIQWEE PFGIVMLEAM ATGTPVVALR RGAVPELVVP
GRTGLICEHV DELPGALRAA SRLDPGVCVA HVVENFSTAR LVDGYETVFQ RFVSAVVPAR
EPAPITFR