Gene Rcas_4131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4131 
Symbol 
ID5541642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5346011 
End bp5348044 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content59% 
IMG OID640896243 
Productglycosyl transferase family protein 
Protein accessionYP_001434181 
Protein GI156744052 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00103927 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000820148 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAACGCA CACTTGTCGC GCTGTTCCTT ATCGCACTGG CGCCGCGCAT TTGGGCGCTC 
AATTGGGGAT TGCCCTATGT TGAACATCCC GACGAGCCCG CCCTGGTCGA AACAGTGGTG
CGTATGGTGC AGGAGGGCGA CTGGAATCCG CGACGATTCG TGTATCCATC GCTCTCTTTC
TACTTGCTGG CGGGCGTCGT CTTTCTCCAT GCGCAATGGG GCATAGCAAC CGGCATCTAT
GCATCGATCG CCGATCTGCC GCTCAAGACA TATCTCTTCA CTCTGGCGCC AGATCTGTAC
ATCTGGTTGC GCGCGCTGAT CGCGATCCTG GGGGCAGCGA CCGTTCCCCT CATCTACATA
CTGGCGCGGC GAATGTTCGA TGCACCTTCA GCATACCTCG CAGCGCTGAC ACTCAGTGTT
GCGGAATACC ACGTGCAACA TGCGCACTTT ATCACCACCG ATGCGCCAAC CGGTCTATGG
ACGACGTTGG CGCTGCTGGG CATATGGAAT GTCGCCGAGC GCGGCAGATT GCGCGATTAT
GTGTTGACCG GGATTGCGAC CGGTCTTGCT GCCGGAACGA AATATCAGGC AGGGGTTGTC
GGTCTGGCGC TAGGAGCGGC TGTTGCCGCG CGCCTGATCG ATTCGCGCAC CACAGGCGAA
CTGACACGCT CCGAAGTGAC AGCGCACATC AGAGGCATCG CAGTTGCGGC CGGTCTCGCG
CTCATCGTCT TTGCGCTGAC CACGCCATTC GCAATCCTCG ATATGTCATC GTTCCGCCGG
AGCATTGCCA GCACAATGAC CCAGTACGCC ACGAACGAAG GACAGGGCGA CTTCAGCGGC
GCCTGGCGCC TGGATGGATA TGCACGATTT TTCTGGGAGG ACGGCTTGTT GCCGTCGGGC
GTCCTGCTGA TGGCGGCAGG TTTGCCGTTT CTGGCGCGCT GTGCGCCACG GCAGACGATG
ATTCTGATTG CCGCCATCCT GGTTGGGCTT GCGCCTCTGG CGCCGCAAAC GGTCCATTTT
ATGCGCAACA CGCTGCCGGT CTTTCCGCTG CTTATTCTTC TGGCTTCCGG TGCAACGATC
AGTCTGGGCA GAGCCATCGG GCGATTGCAG TGCCTGAACT CGCAGCCAAT GAAACCGCAA
ACGGTGTCAA ACCGTGTGCT CATGACGCTT CCCATTATCC TGGGAGCGAG CGCGCTGATT
GCGCCGCAGA TTCAAGAAAC CACCTGGCGC CTCTCCTACT GGAGCAGACC GTATACGCTG
GTTCAGGCAG CAGACGTAAT CCGCGCTGAA CCACGAGGAA TGCTCGCGGC TGTTGAAGCA
AATCCCGTTC AGTGGGCGAA CGATCCGGTT GTGCATCCCG TCGACAGCGT CAGCGACCAT
CCTCCAGAGT GGTACTTGTC GCGCGGCTAT CGCTATCTTC TGCTGAACGA GGATCGGCGC
CGCAATCAGG AGAATTACGC CCGTCTGCTG GAAAGCGGCA TGCCATTGTT GGTCATGCCG
CCGCGCGATC TTGGATTGCA GCCGGGTCCT GGCGGAATTG TGCTGGATAT GGGGGAGCGT
ATTGACCTGA TACCGTTCAC GCGGCGCTCT GCACGTTTTG GCGACAGCAT TGATCTGCTT
GGATACGAAC TGCAACCCGG CGATCTCCGG TCGCGCATTA CCCCGCTGGA AGGCGCAAAC
CTTCGTATCT TTGCGCCCGG TCAGTCATTG CAACTTAATC TCTACTGGCG CGCGCTTACT
CGCATGGATC GCAATCTTGT TCTCTTCATC CATATCAACA ATCAGTACGA TCAGCGCGTT
GCTCAACGCG ACCTGCCCCT CCGTCTCGAC GACTATCCAA CGAGTCGCTG GCGCGTGGGC
GAACTCGTCA TTGATCGCGG CGACATGCCA TTGCCACCAC TTCCAGAAGG CGAGTACCGC
CTGCTGATCG GTCTCTATGA CGCTGAAACG GGAGTGCGAT TGCCGGTACG GGATCAAACC
GCAGTGGAAC TGACCACGAT CCGCGTGATC CACACTGCAC CTTCCTCCAA TTGA
 
Protein sequence
MKRTLVALFL IALAPRIWAL NWGLPYVEHP DEPALVETVV RMVQEGDWNP RRFVYPSLSF 
YLLAGVVFLH AQWGIATGIY ASIADLPLKT YLFTLAPDLY IWLRALIAIL GAATVPLIYI
LARRMFDAPS AYLAALTLSV AEYHVQHAHF ITTDAPTGLW TTLALLGIWN VAERGRLRDY
VLTGIATGLA AGTKYQAGVV GLALGAAVAA RLIDSRTTGE LTRSEVTAHI RGIAVAAGLA
LIVFALTTPF AILDMSSFRR SIASTMTQYA TNEGQGDFSG AWRLDGYARF FWEDGLLPSG
VLLMAAGLPF LARCAPRQTM ILIAAILVGL APLAPQTVHF MRNTLPVFPL LILLASGATI
SLGRAIGRLQ CLNSQPMKPQ TVSNRVLMTL PIILGASALI APQIQETTWR LSYWSRPYTL
VQAADVIRAE PRGMLAAVEA NPVQWANDPV VHPVDSVSDH PPEWYLSRGY RYLLLNEDRR
RNQENYARLL ESGMPLLVMP PRDLGLQPGP GGIVLDMGER IDLIPFTRRS ARFGDSIDLL
GYELQPGDLR SRITPLEGAN LRIFAPGQSL QLNLYWRALT RMDRNLVLFI HINNQYDQRV
AQRDLPLRLD DYPTSRWRVG ELVIDRGDMP LPPLPEGEYR LLIGLYDAET GVRLPVRDQT
AVELTTIRVI HTAPSSN