Gene Rcas_2968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2968 
Symbol 
ID5540459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3850702 
End bp3852279 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content60% 
IMG OID640895087 
Product4-alpha-glucanotransferase 
Protein accessionYP_001433045 
Protein GI156742916 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1640] 4-alpha-glucanotransferase 
TIGRFAM ID[TIGR00217] 4-alpha-glucanotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.34566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000602964 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCTATCA AGACGGGAAC GATTGCGATA TTGAGTCGAA ACGGCCGTTT AGTTAACGAG 
TCGTCTATGA GACCTTTTCC ACGCCGCAGC GGGGTGATCC TGCATCCGAC ATCACTTCCT
GGACCATACG GCATCGGCGA CCTTGGCGAT GGGGCGTATC GCTTTGTCGA TTTTCTCGCC
GCTGCCGGGC AGACGTACTG GCAGGTCCTG CCGCTCAGTC CGACCGGATA CGCCGACTCG
CCCTACCAGG GAATTTCGGC GTTCGCCGGT AATCCGCTGA TGATCAACCC GGACCGCCTG
ATCGCCGATG GTCATCTGAC GTCTGCCGAT CTGGTCGATC GTCCGGCGTT CTCTGATGAT
CGGGTCGATT TTGGCGCTGT GATTGGTTGG AAATTCGCGC TGCTCAATCG CGCCTTTGCG
CGCTTTCAGG CGACGCCATC CACCAACCGC GCGCGGTTCG AGCAGTTTTG TGATGAGCAG
GCAGCCTGGC TCGACGAGGC GGCGCTGTTT ATGGCGCTGA AACAGGCGCA TGGCATGCGC
GCCTGGACGG AGTGGCCTCT GGCGCTTGCA GCGCGCGACC CGGACGCGCT GGCGCTGGCG
CGTGCTGAAC TTGCCGATGT GGTCGAAGCG CACAAATATT TCCAGTGGCT GTTCTTCACA
CAGTGGCAGG AATTGCGACA CTACGCCAAT CGTCGCGGCA TCCGCATTAT TGGCGATGTG
CCGATCTTCG TCGCGCTCGA CAGCGCCGAT GTCTGGGCGA ATCCACATCT GTTCTGCCTC
GATGCAAACC TGCGCCCAAC CGTCGTTTCT GGCGTACCAC CTGATTATTT CAGTGAGACA
GGGCAACTCT GGGGGCATCC GCTCTATCGG TGGGATGTCA TGGCTGCCGA TGGGTATCGC
TGGTGGATCG ACCGCTTTCG CGCTTCGTTT ACGCTTGTCG ATGTCGTGCG TATCGACCAC
TTTCGCGGTT TCTACAACTA CTGGGAAATT CCGGCAGGTG AAACGACTGC GATCAACGGT
CGCTGGGTCG ATGGTCCGCG CGCCGATCTG TTCATGGCGG TCACCGCAGC GCTCGGCGAG
GTGGCGATCA TCGCCGAAGA CCTTGGCGAT TTTACGCCTG AGTCGCGCGC CGGTCTCGAT
GCGCTCATGG CACAGTTTGG CTTTCCGGGA ATGCGCATTC TTCAATTTGC CTTCAACCGC
CGCGAAGGGG ACCGCTTCTT TCCGCACAAC TACCCGCGCG CCTGTGTCGT GTATACTGGA
ACCCACGACA ACGACACCCT TGCCGGATGG TTCACCAACA GTTCGACCGA CGCCGAACGA
CGTGATGCGC TGCGCTATCT CTGCGGCAGC GCCGACGATA TCGTCTGGGC ATTCATCCGC
ACCGCCTGGA TGTCGGTCGC CGACACAGCC ATGACAACGG TGCAGGACCT GCTCGGTCTG
GGGAGCGAGG CGCGGATGAA CCTGCCCGGC ACGCTTGGGT CGCATAACTG GACCTGGCGT
GTGCCATCGG GAGCGCTTGA TCGGCATCGT GCCAGACGCC TGCGCGATCT GACGGAGATT
TATCAGCGAT TGCCCTGA
 
Protein sequence
MSIKTGTIAI LSRNGRLVNE SSMRPFPRRS GVILHPTSLP GPYGIGDLGD GAYRFVDFLA 
AAGQTYWQVL PLSPTGYADS PYQGISAFAG NPLMINPDRL IADGHLTSAD LVDRPAFSDD
RVDFGAVIGW KFALLNRAFA RFQATPSTNR ARFEQFCDEQ AAWLDEAALF MALKQAHGMR
AWTEWPLALA ARDPDALALA RAELADVVEA HKYFQWLFFT QWQELRHYAN RRGIRIIGDV
PIFVALDSAD VWANPHLFCL DANLRPTVVS GVPPDYFSET GQLWGHPLYR WDVMAADGYR
WWIDRFRASF TLVDVVRIDH FRGFYNYWEI PAGETTAING RWVDGPRADL FMAVTAALGE
VAIIAEDLGD FTPESRAGLD ALMAQFGFPG MRILQFAFNR REGDRFFPHN YPRACVVYTG
THDNDTLAGW FTNSSTDAER RDALRYLCGS ADDIVWAFIR TAWMSVADTA MTTVQDLLGL
GSEARMNLPG TLGSHNWTWR VPSGALDRHR ARRLRDLTEI YQRLP