Gene Rcas_3887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3887 
Symbol 
ID5541393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5087102 
End bp5089372 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content62% 
IMG OID640895998 
Producthypothetical protein 
Protein accessionYP_001433941 
Protein GI156743812 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGCTC AACAGACGCT GCGCAGTGCG CGGTCCGTGA CGACGATTGA TCTCATCTTG 
CTGGCTGCTA TTGTTGCGCT GGCGCTGGTC ACACGCCTCT GGTTCTGGCA GGTGCAGGCG
CGTTCCGGCG CCGTGCCGCC GGGCGATCCG GAAGAGTACT ATCGCGCTGC CATCCATATG
CTGCACGGCG GGTACCACGA TACCGGCAAA TGGCTGCGCC CGCCGGTCTA TCCGGCATTC
CTGGCGCTGC TCTTGCCACC GACGAGAATG AATGTCGCCG GAGCGCTGTT GCTTCAGGCG
TGTGTTTTAG GCATCGGAAC GCTGGTATTC TATGCCTCCG GCACGCAACT GTTCGGGCGC
GTCACAGGAA TGGTGACGAC ATTGCTCGCA GCACTGTTCG TGCCGCTGGC ATCGTATGCG
AGTTCGCTTT ATGCCGAAGC GCTGTTTGTG ACACTCCTGG TCATTGGTCT TGCGCTGATC
GACCGCGCGC TGGTTCGCAA CAGCACGCGA GCGGCATGTG GCGCCGGCGT GCTTCTGGCG
CTGGCGGCAC TGACCCGCGC CGTGGGTCTG TACCTGATCC CGCTAGCAGC CGTGTGGATT
GCCTGGCGCA TGCGACACGG CGGTAGCCTG TCGATAGGGA TGGGCTTGTC TGACACGCGC
CTGCGTCGCG TTGCCCGTTC CAAAGATCAC GGCGCCCACA TTCACCCTTC TTCCGATGTA
GGAGAAGGAG CGTGGAAGGA TACAGGGAAA GACGCCTCGC AGAGTGAACA GCACATTATG
AGTGACAAGA GCGACCGCCT GGAGATACGT TCCTATCAAC TGGCAATCTC CCTTATCCTG
GGAGCGCTTC TGGTGGTTGG ACCGTGGGCA GCGCGGAATT ATCTGGCGCA CGGGCGCGTC
ATCCTCAGCG ATACCAATGG CGGCATCAGT ATGTGGTACG GCACAGTGCG CGACGATGCC
GAAGAGAAGG CAGGCGAAGC GCGGCTGGCG GCTGTGCCCA ACCTCGCCGA CCGGCAATCG
CTTGCAATTC AGATGGCGTG GGAGAACATT CGTCATGATC CGGCACGGTT CCTGGCGCGC
ATGCGTTTCA AGATTGCGTC GCTCTACGCG CTGCAAACAC GCAGTTATGC CGTCGGCGAT
GTTATTTCAA TCGACTCGCG TGGTGCGCCA CTGGTTCAGA ATGCAGGCGA ATATCGCCTG
AGCATGACGC TTTTGGCGGA CGTGCAGTAC GTGGCGCTCA TAATCCTGGC AATTGGCGGC
GTCTGTTTTA TGCCGCACCC TGCCCGTGCC ATTCCGACGT TGCTCTGGGT GGGACTGGCG
ACCCTGCTGG CGGTATTGAC CATTGGACAC CCGCGATTGC GCCTTCCGAT TGTTGCGTCT
GTCCTGCCCT TCGCTGCGTA TGCGCTGGTC AGATTGCCCG CAGGATGGCG ACACATCCGT
CAATTGCCGC GCGACCGGCG CAGTTATATG GCGCTGAGCG GAGTGATGGT TTTCCTGGCG
CTGATCGTCA GCATGCGGTA TATTCCGTGG GGTGCGGGTA TGTGGTATGC TGTGCCGGGA
CGATCGGCAC TCGAAGCGGG CGATTTGCGA CAGGCTGAAA CGCTGCTGGC GCTGGCGCAC
GATGCTCACC CGGATAACCC GTTGCGCGTG ATCGATCTTG CCGATCTGCG GTTGGCGCAG
GGCGATGATC GGGCGGCGCT TAGCCTGTAC CGGCGCGCGG CTGAGATGGA ACGTCGCAGC
CTGTATGCGC AGGCGATGCG CGCCATCACC GGCGCGTATC TTGCCATGCC CGACGAAGCG
GCAGCAGGAT TGGCAGCGAT CGATGATTAC TGGCGCTCAG GCAACGATCT GCTCGAATGG
GCATGGACCA CACGGCGACG TCCTGCACCG GATCGCGTCG TTCCCGGCGA TCCGATGGCG
CTGGGACTGT ATGCCGGGTT TGCGCCCGCC ACGCCTGATC TCGCGGTTGG GCGCTGGACC
CTGGGAGAAG GACGGGTGCG GGTGCGTGGC GGCTGCGGCG CCTTAGCGGT TCAGTTGCGC
GGACCATCCG GGCGTCGGGT AGACATCAGC ATCGACGACT GGGGTATTCG AAAGCGGATG
ATAATGAACG GCGAACAACA GGAGGTGCGC CTTGCGCTCT CCGGCATTCG CGAATGTGAA
TTCGGACCCG AACTGACTGT GCATATCGTC AGCGAAACGG GACTGCTCGA TCTGGAGCGG
GCGCCATGGT ACACGGGCGT GGCAGTGTAC GAGGTGCGTG TCGAACGGTG A
 
Protein sequence
MQAQQTLRSA RSVTTIDLIL LAAIVALALV TRLWFWQVQA RSGAVPPGDP EEYYRAAIHM 
LHGGYHDTGK WLRPPVYPAF LALLLPPTRM NVAGALLLQA CVLGIGTLVF YASGTQLFGR
VTGMVTTLLA ALFVPLASYA SSLYAEALFV TLLVIGLALI DRALVRNSTR AACGAGVLLA
LAALTRAVGL YLIPLAAVWI AWRMRHGGSL SIGMGLSDTR LRRVARSKDH GAHIHPSSDV
GEGAWKDTGK DASQSEQHIM SDKSDRLEIR SYQLAISLIL GALLVVGPWA ARNYLAHGRV
ILSDTNGGIS MWYGTVRDDA EEKAGEARLA AVPNLADRQS LAIQMAWENI RHDPARFLAR
MRFKIASLYA LQTRSYAVGD VISIDSRGAP LVQNAGEYRL SMTLLADVQY VALIILAIGG
VCFMPHPARA IPTLLWVGLA TLLAVLTIGH PRLRLPIVAS VLPFAAYALV RLPAGWRHIR
QLPRDRRSYM ALSGVMVFLA LIVSMRYIPW GAGMWYAVPG RSALEAGDLR QAETLLALAH
DAHPDNPLRV IDLADLRLAQ GDDRAALSLY RRAAEMERRS LYAQAMRAIT GAYLAMPDEA
AAGLAAIDDY WRSGNDLLEW AWTTRRRPAP DRVVPGDPMA LGLYAGFAPA TPDLAVGRWT
LGEGRVRVRG GCGALAVQLR GPSGRRVDIS IDDWGIRKRM IMNGEQQEVR LALSGIRECE
FGPELTVHIV SETGLLDLER APWYTGVAVY EVRVER