Gene Rcas_1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1941 
Symbol 
ID5539419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2484902 
End bp2487766 
Gene Length2865 bp 
Protein Length954 aa 
Translation table11 
GC content63% 
IMG OID640894077 
Producthypothetical protein 
Protein accessionYP_001432048 
Protein GI156741919 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000218078 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGGAGTA AGACGATTTC AGGCGATCCG CCTGCACTAC AGCCAACCGT CAGCGCGTAT 
GCGCGCCTGA CGCTGCGTTA TCGTGATGCG CGTCTGGCGC TGGTTGTTGT GCTGGCGCTT
CTGGTGGGCA TCTTTGCGTA TCAGGCGCCG GTTGATGCGA CGATCTTCGT CGGTTGGCCC
GGCGACCGCC TGTTTTTGCA GGCAAGCGAG GGCGCTGGCG CTGCCGACCG GTTCACCTTC
TACGGCGATG AAATCACCGC TGACGCGCCG GGGGGGCGCA GCCGCTGGAC GCACCAGGGA
GCGCGGATTG ATCTGGCAGG TCTGGGCAAG GGTGCGCTTG TGGTGACGGT GCGCGCGCAG
GGGTGGCCCC CCGATGCCCT CAACCGCGCG ACGCGCCAGC CGGAGGTGAC GGTCATAGCG
GACGATACGT TCATTGGACG CTTCACGCCA GGCGAACAGT GGGCAGAGTA CGATGTCGCT
GTTCCCGCCG AGGTGCGGCG CAGCAATGAC CTGACGCTGA TGCTGGTTGC GTCGGATGTC
TTCACCAGCA CGAGCATCTA CAACGATCCT CGTCCAAAAG GAATCCGCAT CGACGCTATT
CGGGTGCGCA GCGTCGGTGA TGGTCCATTC GTGCCTGCGC TGGCGCCGGT GCTCTGGTTG
ACGGTGGGTG GAGGGGTCTG GTTTCTGGCG CTGGCAGGGG GCACTCGCAA GCCAACGTTT
GCCTTCGTTG CTGCGACGCT GCTCGTCAGC GGTGCAGCGC TTGCGCTGGC GGCGGCGCGC
ATCTGGACGG TTGTACTGTT GCCATGGCTC GCCGGGGCCG GCATTGCGCT GCTGATCTGG
CAGCATCGCG TCGCTCTGGT GCGCCATCCG CTGCGCCTGG CGCGCCGCTT CACGCGCGGG
TCGGCGCTCG GCTATGGGCT GCTGGCGGCA ACAGGCGCGT GGCTGGTGGC GATGATCATT
CAGTATGCCC CGCCGATGCC GCGCCTGGAA CAGTTCTGGG AGTTTTTTCC CGACTCGCTC
ATCTATACGC TGATCACGCT GGGCAGTCTG GCGCTGATCC TGGTCTACGG GCGCAATGGA
GTGCCCCGCC TGGCGCAGGC GACGGTGCGC ATGGCAGGTT CGCGGCGCGG TGCGACGCTG
ACGCTGACGC TCCTGCTGGC AGTGTGGGTT GGCTATCTGG GGGCGGTCAT TCTGGCGATG
CCATATGTCG GGCATGCCGA CTATGCCGAT AATGCGGTCG TGGCGCGCAA TCTCCTTGCC
GGACGAGGAT GGACGGTCGA TTATGTGACG CAGTTCTACC GCCTCTACGA CGGTGTCACG
CGCCCGCAGG AAACATGGCC CCTGCTGCAA CCGGTGTGGG TGGCGTCTTC GTTTGCGCTT
TTTGGCGTCA GTAATTGGGC TGCGAAGATT CCGAACCTGC TGTTTCTCAC TGTGCTGGCG
CTGGCGGTCT ATCAGGCTGG CGCGCGCCTG TGGGACCGGC GCGTCGGATT GGTTGCAGTC
GTGTTGCTCA TCACCAGTCA TCTCTACTTC AAACTGGTAA TCTATGTGAC GAATGACCTG
GGGTTTGCGC TGTTTGCGTT CGGCGCACTG CTGCTGCTGT ATCGCGCCTG GGGTGCGCCG
GATGCGATGG CGCTCGCGTC GCGTTTGCGC TTTGGGAGCA TATCGCAGCA ATTGGCGTTG
ACGATCCTCT CTGGTGCGTT AACCGGTCTG ATGCTCTTGC AAAAGCCGGG AAGCGGCGGG
TTGATCGCGC TGGGCATGGG ATGGTGGTTC CTGCGGGTGC GCTTTGACAC CTGGCCGCGA
ACCCTGGCCG ATCTGCGCAC GCGACTGGCG CCCGTAGCGG CGTGGGGTGT GGTTGCGTTT
GTGTTGCTGG CGCCGTATCT GGCGCGGAAT ATGCTCACCT TCGGCGTGCC GTTCTATTCG
ACCGAGGGCA AGGATGCGTG GGTGCTGGAG TACACGACAT GGGATCAGAT CTATGCCGTG
TATACCACCG AGGAAGGGTT CAGTACGCTC GGCGTTCCCG ACCGCAGCTG GATTCTGCGG
TGGGGATTCG ACCGCACCCT GGTCAAAATG GAGCGACAGG TGCGCGCACT GCGCGACTAT
CTGCTACCAT CCTGGCAGCA TGCCCCTTTC GGATTGAGCG AGTGGTTCGG GCGCGCCGAC
AAGGATCGCC TGCTGTTTGA TCCAGGCGCG TGGCTGGCGC TCTTCGGCGC TATGGCTGTC
ATAGCATCAC ATCGCCGCCT GATGACCCTG CTCGGCGCAG CGTTTGTGCC ATACACGCTG
TTTCTGATCG TCTACTGGCA CACGAATGAA GAACGTTACT GGGTTGTGGT GATGCCGTGG
CTTGCGCTTT TCGCTTCGGC AACGCTGTGG TGCATCTATG ACCGGATCGC TGCGCTGTCC
GATGGTCGTT GGACGCCATT CGGGTTGCTG GCAGTCGTTG CACTGATAGC GGCGATCATC
GCGCCGTCCT GGTCTCCAAT CGCCGAAAAG GTGCGCGACG AACCGAACCT GTATCGCGCC
GATCTCGACG CCTATGACTG GCTGAAGCGC AACACGCCGT CTGATGCCGT AGTGATGACG
CGCAACCCGT GGCAACTCAA CTGGCACAGC GAACGTCCGG CGCTCATGAT ACCGTACACG
ACCGACCGGG AGACCTTCCT GCGCCTGGCG CGCCGCTACA ATGTGCGCTA TCTGATGATC
GATACCTTGC AGCGCCCTGA GCCGGAAGTG CGCCGCCTGC TCGATGCCAT GATCGCCGAT
CCGGCATTGG GGTTCCGCGA AGTCTACCGC ACACCAGTGT ATCGCGCCGA TTTTCGCGGC
GTAACGAAGG AGATGATCGC CGTTGTGTAC GCATTCCCGC AGTGA
 
Protein sequence
MRSKTISGDP PALQPTVSAY ARLTLRYRDA RLALVVVLAL LVGIFAYQAP VDATIFVGWP 
GDRLFLQASE GAGAADRFTF YGDEITADAP GGRSRWTHQG ARIDLAGLGK GALVVTVRAQ
GWPPDALNRA TRQPEVTVIA DDTFIGRFTP GEQWAEYDVA VPAEVRRSND LTLMLVASDV
FTSTSIYNDP RPKGIRIDAI RVRSVGDGPF VPALAPVLWL TVGGGVWFLA LAGGTRKPTF
AFVAATLLVS GAALALAAAR IWTVVLLPWL AGAGIALLIW QHRVALVRHP LRLARRFTRG
SALGYGLLAA TGAWLVAMII QYAPPMPRLE QFWEFFPDSL IYTLITLGSL ALILVYGRNG
VPRLAQATVR MAGSRRGATL TLTLLLAVWV GYLGAVILAM PYVGHADYAD NAVVARNLLA
GRGWTVDYVT QFYRLYDGVT RPQETWPLLQ PVWVASSFAL FGVSNWAAKI PNLLFLTVLA
LAVYQAGARL WDRRVGLVAV VLLITSHLYF KLVIYVTNDL GFALFAFGAL LLLYRAWGAP
DAMALASRLR FGSISQQLAL TILSGALTGL MLLQKPGSGG LIALGMGWWF LRVRFDTWPR
TLADLRTRLA PVAAWGVVAF VLLAPYLARN MLTFGVPFYS TEGKDAWVLE YTTWDQIYAV
YTTEEGFSTL GVPDRSWILR WGFDRTLVKM ERQVRALRDY LLPSWQHAPF GLSEWFGRAD
KDRLLFDPGA WLALFGAMAV IASHRRLMTL LGAAFVPYTL FLIVYWHTNE ERYWVVVMPW
LALFASATLW CIYDRIAALS DGRWTPFGLL AVVALIAAII APSWSPIAEK VRDEPNLYRA
DLDAYDWLKR NTPSDAVVMT RNPWQLNWHS ERPALMIPYT TDRETFLRLA RRYNVRYLMI
DTLQRPEPEV RRLLDAMIAD PALGFREVYR TPVYRADFRG VTKEMIAVVY AFPQ