Gene RoseRS_4229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4229 
Symbol 
ID5211214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5298488 
End bp5299825 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content53% 
IMG OID640597818 
Productglycosyl transferase, group 1 
Protein accessionYP_001278522 
Protein GI148658317 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0378432 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCG TTCTGCCAGT ACACCACTTT TTGCCGAAAC ACGTTGGGGG GGCTGAACTC 
TATACCTTGA GGCTGTCCAA CCTGTTTCGT GCTCGGGGGC ATAGCGTCGA AATCGTATGC
GTCGAGTCGC TAGATGGAGA TCACGCTATC CAGGTCGAGG CGGAAAGAGA TTTCTACCAT
GATGTTCCCG TGTGGCGCCT CAGGTTGCCG CGTGACTGCG TCTCCGAGGG CTTGGGCATG
CTCTATGACT ATGCTCCGCT TGGAGCGTGG TTTGCCGACT ATGTGCAGCG GGAGCATCCC
GATGTTGTTC ACTTTCAGGC GGGCTATCTG ATCGGAGTAG CGCCACTCCG CGCTGCTGTC
AGCGCGGGTA TACCAACAGT GCTCACACTC CATGACCACT GGTTTCTTTG CCCGCGTATT
ATGCTACAAC GTGGTGATGG CAGCATCTGC ACCGCGATCC CTGACGATCC AGCCGGTTGC
GCATGGTGTA TGCTCCTTGA AAAACGACGC TATCACATTG CCGATCACCT GACCGGTGGT
CTTGCGGGTC GTCTTGCGCA GTTGTTAATG CTTATTCCCC AAAGGGAGGC TATTGCAGAG
CGCCGTTCAA CTCTAATGGA GTCCCTGAGT TTACCAGATA TCGTGATTGC GCCATCAATG
TATCTTGCCA GTCGCTTTGC CGAATATTTC CAGGCCGGGC GAATGATCGT TTTACGCGGA
GGGATCGATC TTGCTCCTTT CCAGAAAGTG CTATCAGCGC AGCATGATGG TATACTGCGT
TTTGGATTTA TTGGCCTGGT TGCTCCTCAT AAAGGGGTGC ATCTGCTTAT TGAAGCGTTC
CGCCTGTTGA ATAACCGGGA AAGACCTGTT GAGTTACACA TATACGGTAA TACTGATGCG
TATCCTGCTT ATGTCAGAGA GTTACGCCAA AAAGCGCGAG GTGATGATCG CATTCACTTC
CACGGTCGTT TCGAGCCATC CCGCGTTGCA GAAGTGTTCG CTGGCTTCGA TGTCACCGTG
CTACCATCGC TCTGCTACGA GAACAATCCA CTTGTGATTT TGGAGTCTTA CGCTGCTGGA
AAACCGGTGA TAACTGCGGC GATGGCAGGG ATGAAGGAGT TGGTGAACCA TGCGGAGAAC
GGGCTACACT TCAAGGCGGC AGACGCCAGA GATCTGGCAC GACAGATGCA GTTGTTAATC
GATGATACTG ACCTTTTACC AAAGTTGCGC AGAGGCGTCC GACCGCCGCG GAGCATCGAT
CAAGAGGCAG AGGATCTGCT CGGTATCTAT GAACGTCTCT GTCAGCAACG CAACAGTCCG
TCCAGCAAGG TGGTGTAA
 
Protein sequence
MKIVLPVHHF LPKHVGGAEL YTLRLSNLFR ARGHSVEIVC VESLDGDHAI QVEAERDFYH 
DVPVWRLRLP RDCVSEGLGM LYDYAPLGAW FADYVQREHP DVVHFQAGYL IGVAPLRAAV
SAGIPTVLTL HDHWFLCPRI MLQRGDGSIC TAIPDDPAGC AWCMLLEKRR YHIADHLTGG
LAGRLAQLLM LIPQREAIAE RRSTLMESLS LPDIVIAPSM YLASRFAEYF QAGRMIVLRG
GIDLAPFQKV LSAQHDGILR FGFIGLVAPH KGVHLLIEAF RLLNNRERPV ELHIYGNTDA
YPAYVRELRQ KARGDDRIHF HGRFEPSRVA EVFAGFDVTV LPSLCYENNP LVILESYAAG
KPVITAAMAG MKELVNHAEN GLHFKAADAR DLARQMQLLI DDTDLLPKLR RGVRPPRSID
QEAEDLLGIY ERLCQQRNSP SSKVV