Gene Rcas_3026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3026 
Symbol 
ID5540522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3922695 
End bp3923834 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content61% 
IMG OID640895146 
Producthypothetical protein 
Protein accessionYP_001433099 
Protein GI156742970 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1980] Archaeal fructose 1,6-bisphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCGG AATTGACGAT CAGCTGCATC AAGGCTGATG TGGGCGGATT CGTCGGGCAC 
TCGGCAATCC ATCCGGCGTT GAAGGACGAG GCAGGACGTC AACTGGAAAT CGCGCGCCGT
CACGGTCTCT TGATCGATTT TCACGTCACT GCGTGCGGGG ATGACCTGGG GTTGCTCATG
ACCCACCGCC TGGGTGTCGA TAACCACGAT ATTCACCGTC TCGCCTGGGA TGTGTTCGAT
ATCTGCACCC GCGTGGCGAA GGAACTGAAA CTGTACGGCG CCGGACAGGA CCTGTTGAAG
GATGCGTTTT CCGGCAATGT GCGCGGCATG GGTCCGGGCA TCGCCGAGAT GACCATCGTC
GAGCGTCTAT CAGAGCCGAT CATTGTGTTC TTTGCCGATA AAACCAGCGC CGGCGCCTGG
AACCTGCCGC TCTTCCGTAT GTTTGCCGAT CCGTTCAACA CCGCAGGGCT GGTGATTTCA
CCGGCGATGC ACGAAGGGTT TCGCTTCCGT GTGCTCGATG TGCGCAAGGG CGAGACGATC
ACCCTGAGCA CACCGGAGGA GATGTACGAT CTCCTCGTCT TCATTGGTGC ACCAGGGCGC
TACGTGGTAG AAAGCATAAC GGCAAAGAGC ACCGGCGTCA TCGGTGCGGT GTCGTCTACA
CAACGGCTGG CGCTGATTGC CGGGCGCTAC GTCGGGAAGG ACGACCCGGT CTGTATTGTG
CGGGCGCAGG GTGAGTTCCC GGCGGTCGGC GAGGTGCTGG AGCCATTCAC GATGCCGCAC
ATTGTCGAAG GATGGATGCG TGGATCGCAC TACGGTCCGC TGATGCCCTG CCGCATCGGT
GATGCCCATC CGGGGCGCTT CGACGGACCA CCGCGGGTGA TTGCGCTGGG GTTCCAGATC
GCTGAGGGGC GGCTGATTGG ACCGCGTGAT ATGTTCGATG ATCCCAGTTT CGATGAGGCG
CGTCGTCTGT GCAATGTCAT CGCCGACCAT CTGCGCCGTC ATGGTCCCTT CGAGCCGCAC
CGCCTGCCGA TGGAGGAGAT GGAATATACC ACGCTTCCAG AGGTGATGAA GAAACTGGCA
GATCGCTGGC AGCCGGTGAA TGGTCACGCG GTCAGCCCGG AGCAGGTGGC ATACCAATGA
 
Protein sequence
MAAELTISCI KADVGGFVGH SAIHPALKDE AGRQLEIARR HGLLIDFHVT ACGDDLGLLM 
THRLGVDNHD IHRLAWDVFD ICTRVAKELK LYGAGQDLLK DAFSGNVRGM GPGIAEMTIV
ERLSEPIIVF FADKTSAGAW NLPLFRMFAD PFNTAGLVIS PAMHEGFRFR VLDVRKGETI
TLSTPEEMYD LLVFIGAPGR YVVESITAKS TGVIGAVSST QRLALIAGRY VGKDDPVCIV
RAQGEFPAVG EVLEPFTMPH IVEGWMRGSH YGPLMPCRIG DAHPGRFDGP PRVIALGFQI
AEGRLIGPRD MFDDPSFDEA RRLCNVIADH LRRHGPFEPH RLPMEEMEYT TLPEVMKKLA
DRWQPVNGHA VSPEQVAYQ