Gene RoseRS_4438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4438 
Symbol 
ID5211423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5561573 
End bp5562685 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content57% 
IMG OID640598017 
Productglycosyl transferase, group 1 
Protein accessionYP_001278720 
Protein GI148658515 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.356751 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.769914 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTCC TTTTTTGCAT CGACCACCTC CGATCCGATG GGACGCAGCG CGTTCTCTGT 
CAACTGACGA TGGGTTTGAC CAGGCGCCGC CATCGAGTTA CCACGCTTTG TCTCAACGAC
TCTTTCGATG CGCCTATGCT CGATAGTCTG CGGGCAGCGG GCGCAGATGT AAAAATTGCA
GGCAAGGCAG GTCTGGCAGG CGGCTATGGC ATTGTCGATA TCGTGCACTG GATGCAGCGA
GAGCGCTTCG ATGCGGCAGT AACAATGCTC TTCTGGTCCG ATGTCATTGG GCGCATTACA
GCACGCCTGG CGCATATACC GCGTCTCATT TCATCAATTC GAGCGCGCAA CCGGCAGTAT
GCTCTCTGGC AGTTGCTTCT CGTGCGGGCG ACGATGCCTC TTGTGGATGC AGTCGTCCTG
AACAGTCGTC GGGTTGCTGC ATTCGCGGTC GCTGGAGAAG GCGCTCCTCC TGACCGGCTT
GTCCATATCC CGAACGGTGT CGATGTGTCG TCATACGAAC GTGCTCTACC ACGCACAGCG
CTTTGCGCCC GATTTGGCGT GCCGGAGGAT GCCATGATTA TTGGTAGCAT CGGGCGGTTG
ACATACCAAA AAGGGTTCGA TGTGCTTCTC GATGCACTGG CTCAACTTCC ACTCGTAAAT
GTGCATCTCA TCGTGGCAGG CGCAGGGGAA GAGCGGGAGC ATCTGCACAG GCAGGCTCGA
TGCCTGGGTA TCGATAGACG GGTTCATCTG GTTGGATATC GCAGAGATGT CCCCCAATGG
TTGGGGGCGC TCGATGTGTA TGTTCAACCA TCGCGCTTCG AGGGAGCGCC CAATGCGCTG
CTTGAAGCAA TGGCCGCAGG ATGCCCGATT GTTGCGACCG AGGTTGACGG CAACAGCGAA
CTGATTGCTG ATGGGATTCA TGGCTGGTTA GTGCAGGCAG ATCATGTCGG CTCTCTGGCG
GGCGCTCTGG GCGAAGCGTT GGCGAATCGA CCAGAAGCGC GGCGACGTGG TGCAGCTGCA
TATGAGCGTG CACGAACAGA GTTTAGCGTC GAGCGTATGG TGGAAAGGTG GGAACAGGTG
TTGACGAACA ATTGCTGTGC GCCGACCCCC TGA
 
Protein sequence
MKVLFCIDHL RSDGTQRVLC QLTMGLTRRR HRVTTLCLND SFDAPMLDSL RAAGADVKIA 
GKAGLAGGYG IVDIVHWMQR ERFDAAVTML FWSDVIGRIT ARLAHIPRLI SSIRARNRQY
ALWQLLLVRA TMPLVDAVVL NSRRVAAFAV AGEGAPPDRL VHIPNGVDVS SYERALPRTA
LCARFGVPED AMIIGSIGRL TYQKGFDVLL DALAQLPLVN VHLIVAGAGE EREHLHRQAR
CLGIDRRVHL VGYRRDVPQW LGALDVYVQP SRFEGAPNAL LEAMAAGCPI VATEVDGNSE
LIADGIHGWL VQADHVGSLA GALGEALANR PEARRRGAAA YERARTEFSV ERMVERWEQV
LTNNCCAPTP