Gene Rcas_3915 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3915 
Symbol 
ID5541421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5116383 
End bp5118371 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content62% 
IMG OID640896025 
Productalpha amylase catalytic region 
Protein accessionYP_001433968 
Protein GI156743839 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0962846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTCAA GGGTCCTGGC GGTGATTTTA ATAATGAAGG ACATGCCCAT GAACTTCATC 
CGTGACACCC TGAGTCGGCC ACGCCCGCCA AGTGTCAGGC GCTCGGTGCA ACTGCCGCGT
CGCGTGACGT ACTACCCCTC TCCGGTCGAC TGGCGCGATG AGGTGATCTA TTTTCTCCTC
GTTGATCGAT TCAGCGATGG TCAGGAAGAG ACCCGCCCGT TGCTCGACCG GCGCTATCTG
GCGGCAGCGC GCCCGACACT GCCCAATGGT GAGCCGTGGC GCTGGGACCG CTGGGCAGTG
TCGGGCGGTG AACGGTTTCA AGGCGGAACG CTGCGCGGCG TGGTCTCAAA ACTCGGCTAC
CTGCACCGGC TCGGCGTCAC CACTCTGTGG CTCAGCCCGG TCTGCAAGCA ACGCGGTCAC
CTGAACACCT ACCACGGCTA CGCAATCCAG GATTTTCTCG ATGTTGATCC GCGGTTTGGC
ACGCGCCAGG ACCTCGTCGA TCTCGTCAGC GCCGCGCATG AACAGGGAAT GCGCGTGCTA
CTCGACATTG TGTTCCAGCA CTCCGGTCCC AACTGGCGCT ACCCGCCGGA TGTTCCTGGC
GGCGCGGAGA TGCCGCGCTA CACGACTGGA CGTTATCCGT TCGGCAGTTG GTTGGACGCG
ACCGGCGCGC CGCTCCTGGG CGTTCCCGAT GTCGATGATG CTGCCTGGCC CGATGAGATG
CGCAATGTCG GGTACTACAC GCGGGCTGGC GCCGGAGATT TGGGCGCTGG CGATCTCAAT
GATCCGGCGG CAGAGCACAA GCGCTCCGAT TTCTTTACGC TGCGCGACAT CGATCTCGAC
GCGCCCGGCG CGCTGACAGA CATGGCGCTC TGCTACAAAT ACTGGATTGC GCTCACCGAC
TGTGATGGGT TTCGCCTCGA CACGCTCAAA CACGTCTCGT TCGAGCAGGC GCGCAACTTC
TGTGGCACAA TCAAAGAGTT TGCCGCCAAC CTGGGCAAGA CCAACTTCTT CCTGGTTGGC
GAGGTGGCGG GCGGCGATTT TGCTGCGACG CGCTACCTCG ATGCGCTGGA ACGCAACTTG
AACGCGGCAC TCGACATTGG CGAAATGCGG TTGGCGCTGA GCGACGTGGC GAAAGGTCTG
GCGCCAGCAC GCGCCTATTT CGACGGATTT GTTCCCGGGC TTGCCATTAT GGGGTCACAC
CGCAACCTCG GCAGTCGGCA CATCTCGATC CTCGACGATC ACGATCACGT TTTTGGTGCG
AAACTGCGCT TTTCGACCGA TGTTATGGCG CAGCATCAGG CTGCGGTTGC GGCGGCGTTG
CAATTGTTCA CGCTCGGCAT TCCGTGCATC TACTACGGCA CAGAACAGGC GCTCGGTGGT
CCAGAGCTGT CGGAGCGCCG CTGGCTGCCG GAGTGGGGGC GTGCCGATCG TTACCTGCGT
GAGGCGATGT TTGGTCCGCT GCATCCGCGT GCATCGGAAC GCGCTGGACT CGATCCACAG
GCGCGTGATG GGTCGTTGCC GGGGTTTGGT CCGTTCGGCA CTGCCGGGAG TCACTGCTTC
GATGAGCGGT TCCCGGTCTA TGTGCGGATT GCAGCGCTGA GTGCGTTGCG TGCCGCCTAT
CCGGTATTGC GCCATGGGCG CCAGTACCTG CGCCCGATCT CGAATTTCAA CCAGCCATTC
GCTTTTCCGC CAGCCGGAGA AATCATCGCC TGGTCGCGCA TTCTCGACGA CGAAGAAGCG
CTGTGCGTCA TCAACCCACA CGGCGTTGCC GCGCGCGGCG GTGATGTCGT GGTCGACGCG
GCGCTGAACC GTCCTGGCGA TATGCTGACC ATCATCTTAA ACACAGCGCA ATCTGCTGAT
CCGCTGGGGT ACGTCGGACC ACACCCGGTC GGTCAGCGTC TGCCGGTTCG AGAGCGCAAC
GGCGCGTCGT ATGTTGAGAT TCGCAACTTG CCGCCAGCCG AAACGCTGGT GTTGACCAAT
CGACCATAA
 
Protein sequence
MISRVLAVIL IMKDMPMNFI RDTLSRPRPP SVRRSVQLPR RVTYYPSPVD WRDEVIYFLL 
VDRFSDGQEE TRPLLDRRYL AAARPTLPNG EPWRWDRWAV SGGERFQGGT LRGVVSKLGY
LHRLGVTTLW LSPVCKQRGH LNTYHGYAIQ DFLDVDPRFG TRQDLVDLVS AAHEQGMRVL
LDIVFQHSGP NWRYPPDVPG GAEMPRYTTG RYPFGSWLDA TGAPLLGVPD VDDAAWPDEM
RNVGYYTRAG AGDLGAGDLN DPAAEHKRSD FFTLRDIDLD APGALTDMAL CYKYWIALTD
CDGFRLDTLK HVSFEQARNF CGTIKEFAAN LGKTNFFLVG EVAGGDFAAT RYLDALERNL
NAALDIGEMR LALSDVAKGL APARAYFDGF VPGLAIMGSH RNLGSRHISI LDDHDHVFGA
KLRFSTDVMA QHQAAVAAAL QLFTLGIPCI YYGTEQALGG PELSERRWLP EWGRADRYLR
EAMFGPLHPR ASERAGLDPQ ARDGSLPGFG PFGTAGSHCF DERFPVYVRI AALSALRAAY
PVLRHGRQYL RPISNFNQPF AFPPAGEIIA WSRILDDEEA LCVINPHGVA ARGGDVVVDA
ALNRPGDMLT IILNTAQSAD PLGYVGPHPV GQRLPVRERN GASYVEIRNL PPAETLVLTN
RP