Gene Rcas_3538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3538 
Symbol 
ID5541037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4612842 
End bp4614062 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content59% 
IMG OID640895655 
Producthypothetical protein 
Protein accessionYP_001433605 
Protein GI156743476 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.25155 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTATCTG GACGACTTGG AACCGTAGCA CTGTGGCTGC TGATCGTCTG CGCAGCGGTT 
TTTCTGTTTG AACGCGCCGT GGTTGTGGTC AGTTTTTTCG CCACACCGCT CCTTCTCTTC
GCCCTTGCCT GGCTGATCGC CGTTGTGCTA CAACCGCTGG TGTCGCACTT GACGGCGCTC
GATCTGCCGA CGATTACCAT TCGCGCGCAC AGCGTCCCGG TTCCGCCGCG CCATCTGTCG
CGCGTGCTCT CAGTGGCGCT GATCTACCTG GCGCTCTTCG CTATTCTCCT GGTCGTCATT
CTGTCGTTCG TGCCGACAAT TACGCAACAA CTGACGACGT TGACCGGATC GGCGCCGACC
ACGGTCGAAT CGGTTGTCCG GTGGATCGGT CGGCTGGAAG AAGGGCTGCA ACGGTTCGGC
TTTCGCGGCG ACCTGACAGC CATCGTTCAA CCCGAAGCCA TTACCCGGCA ACTTACCGGT
ATCGGCAGTG CGATGTTGCA GCAATCGCTT GGCATTGCCG GCAGCATCGC CACGGTGCTG
TTCAATATTT TCCTGGTGCT GATCCTCAGT TTCTATATTA CGCTCGACGG TCCGCGCATT
GGCAAGAGTT TCATTATGCT CCTTCCCCGA TCCTGGCACG ATGAGATGGA CGGTCTGTTT
GCCGTTGTTG ATCGCGTGTT TGGCGGCTTT ATGCGCGCGC AGTTTGTCAA CTCGCTGCTC
TATGGCATCG CCAACGCGAT TGTAATGGCG CTGTTCGGAT TGAGCGACAT TGCCCTTGCC
AGCGTGATTG CCGCGATCCT GGTATTCATT CCGCTCGTAG GCGGATTTTT TGCGCTGATT
CCTCCGGCGT TGTTCGCCAT TCTGTTTGTG CCGGATCGGG TAGGGTGGCT GGTCCTGGTG
TTGCTGGCGG TGCAGCAGGT GCAGTTCAAT GTGATCATGC CGCGCCTCGT CGGGCAGGCC
ATCGGACTGC ATCCGCTACT CGTCTTTGCC GCACTGCTCC TCGGCGGAAC CGTTGCCGGC
GGATGGGGAG TCCTCTTTGG CATCCCGGTC GCTGGTGTCA TTGCGTCGAT TGCCCAGTTC
TTCTATGAGC GCGCCCGCCG CACCATGATC ATCGTTCCTT CCACAGTCGA TGAATCGTTG
CCGTCAGCCT CTGCCACGGT TGCGGCGTCT TCCGTCGATC CTGCGCCGGG CAGCCCGCAA
TCGTCGCGCT TGACGCAGTA G
 
Protein sequence
MLSGRLGTVA LWLLIVCAAV FLFERAVVVV SFFATPLLLF ALAWLIAVVL QPLVSHLTAL 
DLPTITIRAH SVPVPPRHLS RVLSVALIYL ALFAILLVVI LSFVPTITQQ LTTLTGSAPT
TVESVVRWIG RLEEGLQRFG FRGDLTAIVQ PEAITRQLTG IGSAMLQQSL GIAGSIATVL
FNIFLVLILS FYITLDGPRI GKSFIMLLPR SWHDEMDGLF AVVDRVFGGF MRAQFVNSLL
YGIANAIVMA LFGLSDIALA SVIAAILVFI PLVGGFFALI PPALFAILFV PDRVGWLVLV
LLAVQQVQFN VIMPRLVGQA IGLHPLLVFA ALLLGGTVAG GWGVLFGIPV AGVIASIAQF
FYERARRTMI IVPSTVDESL PSASATVAAS SVDPAPGSPQ SSRLTQ