Gene Rcas_3643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3643 
Symbol 
ID5541145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4765645 
End bp4766976 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content50% 
IMG OID640895763 
ProductABC transporter related 
Protein accessionYP_001433710 
Protein GI156743581 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1134] ABC-type polysaccharide/polyol phosphate transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000162695 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000834131 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCAGTTC CAACCGCAGA CATTCTCCCT CAGGAAGCGG CGCCACTCCC CGCACCGACC 
GACGAGGTGG TCATCAGCGT CCAGAACGTC GGCAAAATGT ACCGCATCTA CGACCAACCG
CAGGATCGGC TGAAGCACAT GCTCTTCTGG CGGTTCGGCA AGCATTACGG GCGCGAGTTT
TGGGCGCTGC GCAACGTCAG TTTCGAGGTG CGGCGCGGCG AAACGGTGGG GATCATCGGG
CGCAACGGGA GCGGCAAGAG CACGCTGCTC CAGATCATTG CCGGCACCCT GGCGCCGACC
GAAGGCGAAG TGCGGGTGAA GGGGCGCGTG GCGGCGCTGC TTGAACTCGG CAGCGGCTTT
AATCCCGAAT TCACCGGGCG CGAAAATGTC TACCTCAACG GCGCGATCCT GGGGCTGAGC
CGCGACGAAA TCGACGCGCG CTTCGACGAC ATTGCCGCAT TCGCAGACAT CGGCGAGTTC
ATTGATCAGC CGGTGAAGGT GTATTCGAGC GGCATGTATG CGCGGTTGGC GTTTGCGGTT
GCTGTCTCGC TCGATCCTGA CATACTGATT GTTGATGAAA TTCTTGCTGT GGGTGATCTT
GCGTTTCAGC AGCGTTGTGC GACAAGATTG CGGCAACTTC GAGATGAAGG GTTGACGCTT
TTGTATGTCA GTCACTCGCC TGACTCTATC AAGAGCGTTT GTCAAAATGC TTTGCTACTA
GTTGAAGGAA AAGTTTTTTA CAAAGGGCAT GCGGAAGAGA CGATGAATAA GTATCTCAAC
CTGATTCGTG AAGAGACAAA TGCAGCTATG TTGTCGAAAG AGGCACATCT TCCTCGATAT
ATACCATTTC ATTCTCTGGC AAAAGCGAAG CTTCGCTATG GGTCAGGTCA CGTTCAAATC
GAAAGAGTGG AATTGTTGGA CGAGGACGGA CAATTCTCTC ATTCTTTTAG ATTTGGTGAT
ATGATTACGA TTCTAGTAGA GTTCCGTTCG TATATCGAGG TGAACCATTT TAGTTGTTCG
TTTCTAGTAA GGGATTCTAC AGGCGTTGAT CTTTTTGGGA CAACTACTTT TGATGAGAGA
ATAGAGTTTC CAATACTCAG AGTTGGAAAC AGAGGGCTTG TTCGATTCAA ATTTCTCAAT
CAACTGAGGA TTGGTAACTA TGGTGTGTCA GTGGCTCTGA ATCGTACTTC GCAAAGAGAT
TATTCTGATA ATATTCTTTT CGATCAGATA GACGGTGCTG CTGTGTTTTC CGTGGTGCCT
GATATTTCCA GGCCGGTGCA TTACAAATTC TTTGTTCCGA TTTCGATAGA TTATGAGGTA
ACGTTTGAAT GA
 
Protein sequence
MAVPTADILP QEAAPLPAPT DEVVISVQNV GKMYRIYDQP QDRLKHMLFW RFGKHYGREF 
WALRNVSFEV RRGETVGIIG RNGSGKSTLL QIIAGTLAPT EGEVRVKGRV AALLELGSGF
NPEFTGRENV YLNGAILGLS RDEIDARFDD IAAFADIGEF IDQPVKVYSS GMYARLAFAV
AVSLDPDILI VDEILAVGDL AFQQRCATRL RQLRDEGLTL LYVSHSPDSI KSVCQNALLL
VEGKVFYKGH AEETMNKYLN LIREETNAAM LSKEAHLPRY IPFHSLAKAK LRYGSGHVQI
ERVELLDEDG QFSHSFRFGD MITILVEFRS YIEVNHFSCS FLVRDSTGVD LFGTTTFDER
IEFPILRVGN RGLVRFKFLN QLRIGNYGVS VALNRTSQRD YSDNILFDQI DGAAVFSVVP
DISRPVHYKF FVPISIDYEV TFE