Gene Rcas_1150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1150 
Symbol 
ID5538616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1487835 
End bp1489880 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content60% 
IMG OID640893282 
Productglycosyl transferase family protein 
Protein accessionYP_001431265 
Protein GI156741136 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.620757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCTT TCACGCGCGT TCTGAGCCGT CGTACAGTGT CAGCAGTTCC GGTCATCCTC 
TTTCTTCTGG CAATTATTGT GCGCTTACCC AACGGCATCT TTCTCACGGT CGATGAAGCC
TACCACTGGA TCGCGCTCTC GAAACGTTTC GCCACCACGC TCTCTACCGG CAATTTTGCC
GATACCTTCT ACTTCGGGCA TCCGGCTGTT ACGACTCTCT GGCTTGGCGT GGCGGGCCAT
TGGCTCTACG ATACCCTGAC TGTGCCCGAA ATCATTGGTG AGGGCAAGGA TACCTTCTAT
CCCCTGGTGC GCCTGCCGGC AGCGTTGGTC ACGGCGCTTG CGCTTGCACT TTCGTATCCG
CTGCTCAAAC GCTTATTTGG CTTCCATGTT GCGCTCCTTG CGGCGCTTCT CTGGGTGGGT
GAGCCGTTTC TGGTCGCTCA CTCGCAGTTG TTTCACATGG ACGCCACGCT CACCTCGTTG
ATGACGCTGA GTCTGCTTTT ACTGTTGATT GCTCTCCAGC ATCGCGAGGG CAGTCTGCTC
GGTTCGCCCT GGTGGATCGG CAGCGGTGTG GCGTTCGGTC TGGGGCTGCT CACAAAGTCG
CCTGCGCTCT TGCTCATACC GATGGCGGGG CTGATTGCAC TGTTTCGGCA GTGGGACGCC
ATCCGCGATA TTCGCTCATT TGTGAAAGCG CTCATCGGTT GCGTTGCGCC GCTGTGCGTG
TGGGGCGGGA TTGCGGCAAT TGTCTGGACG GTGCTCTGGC CCGCGATGTG GGTCAAACCG
CTGGCTACAA CGTGGGGTGT GGTGTCGGAG ATCCTCTTCG ATGGCGGCGC GCCGCACCCC
TGGGGCAATT TCTTTATGGG ACGCGCTGTT GACGATCCAG GACCGCTCTT CTATCTTGTG
GCGATCCCTT TCCGACTGGC GCCCTGGACG TTGATCGGCG TTCTGTTGTG GGTCGGATTC
ACCGTGACGG ATGGCAGGCG CGCATGGAGT GGATCGAACC GCTGGTTGCT ACTGCTGGCG
CTCTTCGTTG TCCTTTTTGT TGCGGCGATC TCGGTGATGG CAAAAAAATT CGACCGCTAT
GCGCTACCGG TCTTTCCTGC GCTGACAATC CTGGCGGCAG CCGGGATCTG CCGCAGCGCC
GGTTTCTTGT GGCGCGGTAT CACAAGGTTT GTCGTCGTTC CGCCGCAGCG GTTCATCGTT
GCGGGGTATG CGTGTGCCAT CATCACGCTG GCGATTAACC TGTACGCCTA TTTCCCGTAC
TATCTGGCAT ATTACAGCCC GCTGCTGGGA GGAGGAGCGG CGGCAGAGCA GATTCTGCCC
GTTGGTTGGG GGGAAGGGCT TGAACTGGCT GCGGCATTTA TCGCCGCACA ACCCGACGGC
AAGGATCGTC CGATTGCCGT GTTCTATCAA CCGGTGCTAC GCCCATTTGC GCCGGCGGGT
GTCGCACCGC TCCAGGCGAT CCAGGACCCG CGTCGGGTCG ATTATGCGAT TGCGTACATC
GATCAGTTGC AACGCAACAC CCAACCCGAA CTTCACCAAC CATTTCGCCG TCTGCAACCG
TTGTACACCG TGCGCATCCA TGGTATCAAC TACGCCTACG TCTACCAGGT TCCGCCGCCG
GTCGCGCAGC CACTCAAGGC CGATTTCGGC GAAGCCATTC ATCTGCGCGG CTACGATCTG
GATGCTTCGG CGATCCGATC CGGCGGCGCG CTGACCATTA CTCTGGAGTG GCAGGCGCGC
GCGCCGGTGG ACAATGACTA TGTTCTCTTT ATTCACGTGT TGAATGATCG TGCTGAGAGG
GTAGCACAGA TCGACGTGCC GCTCGGAACG GATCACTGGA CCTCGCGGAC ATGGCGAGCG
GGACGATGCT TTTCAACCCT GTATCGTGTT CCGCTTCCGT TCGATCTGCC GGCCGGGACG
TATCGCCTGG CGATGGGTGT GTACGACCCG CACACCTTTG CCAGATTGTC GCTGCGCACG
GACGGGGAAC GCGCTACGGA TGCCGGCGAA CACGCGCTGC TCTTGACGCG CATCACGATC
CCTTGA
 
Protein sequence
MAAFTRVLSR RTVSAVPVIL FLLAIIVRLP NGIFLTVDEA YHWIALSKRF ATTLSTGNFA 
DTFYFGHPAV TTLWLGVAGH WLYDTLTVPE IIGEGKDTFY PLVRLPAALV TALALALSYP
LLKRLFGFHV ALLAALLWVG EPFLVAHSQL FHMDATLTSL MTLSLLLLLI ALQHREGSLL
GSPWWIGSGV AFGLGLLTKS PALLLIPMAG LIALFRQWDA IRDIRSFVKA LIGCVAPLCV
WGGIAAIVWT VLWPAMWVKP LATTWGVVSE ILFDGGAPHP WGNFFMGRAV DDPGPLFYLV
AIPFRLAPWT LIGVLLWVGF TVTDGRRAWS GSNRWLLLLA LFVVLFVAAI SVMAKKFDRY
ALPVFPALTI LAAAGICRSA GFLWRGITRF VVVPPQRFIV AGYACAIITL AINLYAYFPY
YLAYYSPLLG GGAAAEQILP VGWGEGLELA AAFIAAQPDG KDRPIAVFYQ PVLRPFAPAG
VAPLQAIQDP RRVDYAIAYI DQLQRNTQPE LHQPFRRLQP LYTVRIHGIN YAYVYQVPPP
VAQPLKADFG EAIHLRGYDL DASAIRSGGA LTITLEWQAR APVDNDYVLF IHVLNDRAER
VAQIDVPLGT DHWTSRTWRA GRCFSTLYRV PLPFDLPAGT YRLAMGVYDP HTFARLSLRT
DGERATDAGE HALLLTRITI P