Gene RoseRS_4221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4221 
Symbol 
ID5211206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5284319 
End bp5285824 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content55% 
IMG OID640597810 
Producthypothetical protein 
Protein accessionYP_001278514 
Protein GI148658309 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTTC GTCTTCACAT CTCATCGTAT CGTTTCATCG CGCTGGTTGT TAGCGTACTG 
GCGATTGGAT ATCTTCTGTA CGCTCATCAA CCCCTGTATC CTCCCCCCTG GTTTGATGAA
GGATTAAACG CAGGCACAGC CGCTACGCTG GCTCGTAGCG GCTTGTATGC ATTGCCAGAC
CCCGAAAAGC CCCGAGTTCT CGATCCAGCG ATCCAGACCG GTCCAACGGT CATTGTTCCG
ATTGCCCTTG CGTACCGCAT ATTCAGTCCA GGCGTCTGGC TGGCGCGGAT GGTTGTCCTT
CCATTTGCGG TGCTGGCGTG TGTTGGATTT ATTCTCATAG CGCGACGGCT CATCGGAGAT
GGCGGCGCCG GTCTGGCGTT CCTCTTTTTG CTGGCAGGCA CATATGATAT CTATGCCAGT
TTTGTGCCGA TGGCACGCCA GGCATTGGGC GAAGTTCCTG CGCTCGCCTA TCTGCTGATT
GGTCTTTCGA TCTGGTTCCG CTCGCTGGAA CGCAAAACGT ATGCTCCGGT CGCCTGGGTA
TTCAGCGGGA TTGCCTGGGG CATTGCCATG GTCACCAAAT CGCAGGTGCT CATTCTCGTG
CCTGTCGCAC TGGGGGTTAT CCTGGTGTTG GACAGACTGT ACTACCGTAA GGCCAGCTGG
CTGGCAGTGA TTGTTCCTGG CATCTGCGCT ATGGCATGTG TCGCTGGCTG GTATGCGGCT
CAGATCGCTC TTGTCGGTAG TGATCGATTC CAGGACAATG CTTCGGTGTT GCGTGACGGA
TTCTGGCTCC ATATCGCAGC GCTCGATCCA GAACGCTGGC GCAATGCCTT GAGCGTCCTC
TGGCGAACCG GATGGTGGCT GTGGGGAGTC CTCGCAATCG TATGGGGCGT CTATCAGGCG
CGCCGGCCAA CCTTTCACGG ATTTATCCAC GCCGTATTGC TGATCTTTCT GGGGGTCAAC
CTGGTATGGT TTGCTGCGCT GTCGATCGGG TGGGCGCGGT ACGCTTTTTA CTTTCTTGTT
CTGACGACGA TCCCGCTCGC AGGCGCTCTC ATCGCGCTCT GGAACTACGC TGCGATCCCC
TCTATCCCCC GGAAAGCAAT AGTCGTTCTT TTATGTGGTA CGTATGTGGT GTATCAGGGT
CTGCCAGACA AAGTCTATAA CGTCCTCCAT CCTTCCGACA ACGGCTACCG GCACATGGTG
ACAGTTTTAC AACGTATCGT TCCTGAAGAT GCAATCGTTG CATCCTGGGA ATGGGAGTTT
AGCATCGAGT CTGATCGGCG CATCACCTAT CCGTCTACTC ATGCGGCGAA TGTCTATACT
CGTCACCTTA TGCTTCGCCA ACGATTGCCT GACGATATGC GCGATGTCGT TCCTTCAGAT
CCGGATTATA TCCTCATAGG CAGTTTTGGC TCGTGGACAA AGATATACGA CAGATACATC
ACCTCTCGCA ATCTTCAACT GGTCGCCAGG CATGGCGTCT ACAGCCTCTA TCGGGTCGTG
AAATAG
 
Protein sequence
MTLRLHISSY RFIALVVSVL AIGYLLYAHQ PLYPPPWFDE GLNAGTAATL ARSGLYALPD 
PEKPRVLDPA IQTGPTVIVP IALAYRIFSP GVWLARMVVL PFAVLACVGF ILIARRLIGD
GGAGLAFLFL LAGTYDIYAS FVPMARQALG EVPALAYLLI GLSIWFRSLE RKTYAPVAWV
FSGIAWGIAM VTKSQVLILV PVALGVILVL DRLYYRKASW LAVIVPGICA MACVAGWYAA
QIALVGSDRF QDNASVLRDG FWLHIAALDP ERWRNALSVL WRTGWWLWGV LAIVWGVYQA
RRPTFHGFIH AVLLIFLGVN LVWFAALSIG WARYAFYFLV LTTIPLAGAL IALWNYAAIP
SIPRKAIVVL LCGTYVVYQG LPDKVYNVLH PSDNGYRHMV TVLQRIVPED AIVASWEWEF
SIESDRRITY PSTHAANVYT RHLMLRQRLP DDMRDVVPSD PDYILIGSFG SWTKIYDRYI
TSRNLQLVAR HGVYSLYRVV K