Gene RoseRS_4433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4433 
Symbol 
ID5211418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5555222 
End bp5556802 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content65% 
IMG OID640598012 
Producthypothetical protein 
Protein accessionYP_001278715 
Protein GI148658510 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.576775 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCACA TCACCTCTGC CGGTCGAACC GATGCTGTTG GGCGCCCGTT GTCCACTGTT 
CGCGCAGCGT TGCTGATCGC TGCCGCAACA TTGCTCATCG GTCTCTGGAA CCTGGAGGGT
CCTGAGTTCT GGTGGGACGA GGGATGGACG CTTTCGGTGG CGCGCAATGT CGTTGAGCGC
GGGCACTATG GTCGCTTGCT CGACGGTCAA CCTGCGCCCG GCGGGCTGGA AGCGTCGCCG
GTGGTAACGT TGCCGGTCGC GTTGAGTTTT CAGGTCTTTG GCGTGGGTCT CTGGCAGGGG
CGACTCGTCA GCCTGGCAGC GGCGGCTGCG GCGCTGGCGC TGATGTTCGT TCTGGCTGCG
CGGCTCTACG ACCGGCGGGT GGCGTGGGGA ACGCCCGGCG CGCTCCTGTT GCTCACGGCG
CATCCGCAGT TGAATGCGCT TCTGACCGGG AGGCAGGCGC TGGGTGAGAT GCCGATGCTC
CTTTTCTTGA TGGGCGGGTA TCTCTGTCTG GACGCGGCAG TGCGCGGACG AGCGTTCTGG
ATCGCGCCGG CGGCGCTGCT CTGGGCGCTT GCGTCGCTTG CCAAAGCGCA AACATTGCCG
TTCTGGGTCG TCTCGATGGC GGGCGCGACA GGCGTGGCGC TGCTGATGCA TCGCTGGCGT
GCAGCCGCAC TGGTGGCGGG AGGGGCGCTG CTGGCGTACC TGGCGCGTCC ATGGGTGTTG
CAGATTGTAA TGCTGCCGGT TACGGGGCGC ACCCTGCCCG GCACGCCGGT CAGCGGCATC
TATGACGTGA CCGCGTTCGT GCCCAATCTA TCGAACCGGT TGTTCGCCCT CCAGATGATC
CTGATCGGCG GGATTCCGAC GCTGATCGGG TTGGGGTACG CCGCATGGCG TCTCCTGAAG
GATGTGCGAG CAACCAATCC GGCAAGCGAT GATGATCGTG GCGCGCGGAT CGTACTGCGC
GCTGCACTGC TCGCCCTGGC AGGCAGCTGG TTCGCCTGGT TTGCCCTGCT ATCGGTCGGC
GTGCCGCGGT ACCTCTTCCC GGCGACGTTC GTTGCGGGCA TGTTTACTGC GGCGCTGGTG
CACGACCTGA CGAATGGATT TCGCCCGGCA TTCGTGGTGG AGGGGCTGGT TGCCCCGCTC
AGGACGCGAC GATTGACGCA GTGGAGCGCG GGGGCATGGC TGGCGATGTT GCTGGCGGCG
ACGACAGCGC CGCTGACGAT GCTGTCGTAC TGGCAGCATT TCACCGCCGA TGAGCGCGCT
GCCGTTCGGG TGGCTGCATT CCTGAACACG CAGACGCCGC CTGACGCCCT GATCGAAACC
TACGAAAGCG AATTGCACTT CCTGCTCGAC CGCCGGTACC ACTATCCGCC GGATCAGACG
CACGTCGAGT TGAACCGGCG CAGTCTGCTG GGGCAGGACA CGCCGGTTGC GTATGAACCG
CCGGCGGTCT ACCCTTCCTA CCTGGTGGTG GGGCGGTTTG CGGCGGGCAA CCGCCTGTAT
GACACGGCGC TCGCGTCGGG CGCTTTCCGC GAAGTCATGC AGGACGGGCG GTACACCGTG
TACGAACGGG TAAGGAAATA A
 
Protein sequence
MNHITSAGRT DAVGRPLSTV RAALLIAAAT LLIGLWNLEG PEFWWDEGWT LSVARNVVER 
GHYGRLLDGQ PAPGGLEASP VVTLPVALSF QVFGVGLWQG RLVSLAAAAA ALALMFVLAA
RLYDRRVAWG TPGALLLLTA HPQLNALLTG RQALGEMPML LFLMGGYLCL DAAVRGRAFW
IAPAALLWAL ASLAKAQTLP FWVVSMAGAT GVALLMHRWR AAALVAGGAL LAYLARPWVL
QIVMLPVTGR TLPGTPVSGI YDVTAFVPNL SNRLFALQMI LIGGIPTLIG LGYAAWRLLK
DVRATNPASD DDRGARIVLR AALLALAGSW FAWFALLSVG VPRYLFPATF VAGMFTAALV
HDLTNGFRPA FVVEGLVAPL RTRRLTQWSA GAWLAMLLAA TTAPLTMLSY WQHFTADERA
AVRVAAFLNT QTPPDALIET YESELHFLLD RRYHYPPDQT HVELNRRSLL GQDTPVAYEP
PAVYPSYLVV GRFAAGNRLY DTALASGAFR EVMQDGRYTV YERVRK