Gene Rcas_4417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4417 
Symbol 
ID5541930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5671757 
End bp5672803 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content60% 
IMG OID640896515 
ProductTRAP transporter solute receptor TAXI family protein 
Protein accessionYP_001434451 
Protein GI156744322 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID[TIGR02122] TRAP transporter solute receptor, TAXI family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0674202 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAAAC GAACGCTGGT CGTCTTCTGG TCAATTATGG CGCTAGTACT GGCTGCGTGT 
GGACAGCCGA CCCCCGCAGC GCCAACAACC GCCCCTTCAG CCCCCACCAC CGCGCCTTCC
GGCGAACAAC CCACAACAGC GCCCGCCGCG CCCAGCGGGG AAAAACTGCG CTTGATCATC
GGCACCGGCG GCACCGGCGG CGTGTTCTTC CCCTATGGCG GCGGTCTGGC GCGTATTCTT
ACCGAAAAGA TGCCCAACAC GGAAGCGACG GCGCAGGAAA CCGGCGGCTC CGTCGATAAT
ATGAAACTGC TCCAGAATTC GGAAGCGCAA ATCGGCTTCA CCACCGCAGA TTCGGCATAC
GATGGCATTA ATGGCGTGGC AGCATACCAG GCGACCGGTC CGGTTCCCGC CGCAACCATT
GCCGTGCTCT ATCAGAGTTT CATCCATGTG GTCGCGCGCG CCGATTCGGG TATCACAAGC
GTCGCCGATA TGAAAGGGCG ACCTGTCTCG ATTGGATCGG CGGGCAGCAG CACGGAAACC
GCGGCTATCC GCCTGCTCGA AGCCGCAGGG CTGAAGCCGG AAGATGTGGT CCGCGAAAAC
CTGGGGGTGG CCGACTCGGT GGCTGCCATG AAGGATAAGA AGATCGATGC CTTCTTCTGG
ATCGGCGGTC TGCCGACGGC TGCGGTGACC GACCTGGTTA CGACCGAGTC GGTGGTCTTT
ATCGACACCA GTTCGCTGCT GAAGCCGATG GTGGATAAGT ATGGTCCAAT CTACGCCGCT
ACCGTGCTGC CTGGCGGCAC CTACAAAGGG ACGGACCAGG ATGTGCCGGG GATTGGCGTT
GCCAACCTGC TCGTGGTGCG CGAGGATATG CCCGCTGACC AGGTAAAAGG CATTCTGACC
ACAATTTTCG ACAATCTCGA AGAAGTGCAT CAGATCCATC CCGCAGCGCG CACCCTGTCG
CTTGAATCGG CAGTCACCGG TTCATCGATA CCGTTCCATC CGGCTGCAAT CGAGTTCTAC
AAGGAACGTG GCGTTTGGAA GGAGTGA
 
Protein sequence
MVKRTLVVFW SIMALVLAAC GQPTPAAPTT APSAPTTAPS GEQPTTAPAA PSGEKLRLII 
GTGGTGGVFF PYGGGLARIL TEKMPNTEAT AQETGGSVDN MKLLQNSEAQ IGFTTADSAY
DGINGVAAYQ ATGPVPAATI AVLYQSFIHV VARADSGITS VADMKGRPVS IGSAGSSTET
AAIRLLEAAG LKPEDVVREN LGVADSVAAM KDKKIDAFFW IGGLPTAAVT DLVTTESVVF
IDTSSLLKPM VDKYGPIYAA TVLPGGTYKG TDQDVPGIGV ANLLVVREDM PADQVKGILT
TIFDNLEEVH QIHPAARTLS LESAVTGSSI PFHPAAIEFY KERGVWKE