Gene Rcas_4418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4418 
Symbol 
ID5541931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5673061 
End bp5675346 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content60% 
IMG OID640896516 
ProductTRAP transporter, 4TM/12TM fusion protein 
Protein accessionYP_001434452 
Protein GI156744323 
COG category[R] General function prediction only 
COG ID[COG4666] TRAP-type uncharacterized transport system, fused permease components 
TIGRFAM ID[TIGR02123] TRAP transporter, 4TM/12TM fusion protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.868477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0895088 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCG CCGCCAACGC TCAAAACAAC GACGCGCCTG ACGAAGAGGT CATTTCCAGG 
GAAAAAGTCG AGCAACTCAT CGAGGAGTTC GAGAACGAAG CAGCAACCCG AAAACTGAGC
GGCGCCTGGG CATGGATCGC CGGGATTGTT GCGGCGGCGC TCTCGATCTA TGCACTCTAC
TGGACGCAGG CGATCATCAC GACTCAGGTT TACCGTGCCA CGTTCCTGAT GCTCGTCCTG
GCGTTGACGT TCTTCTACTT CCCATTGCGC AAGGCTGCGC GCACGAAAGT GCCCTGGTAT
GATGTTGTGC TGGCAGCGCT TGGCGCTGCC AGTATGATTT ATTTGAGCCT CAATTTCCGT
GATGCGTTGC AGCGGGTCAC TCAACCCACG CCGACCGAAC TCGTGATGGG CGCCATTATG
CTGCTGCTGG TGTTGGAAGC GACCCGCCGC ACCACAGGTA TGGCGCTGAC ACTGGTAGCA
GTGTTCTCGA TACTGTACGC GCTCTTCGGG TATGTTTTTC CTGAACCGTT CGACCATCGC
GGAATCTCGC TGCAACGTCT GATCGGCACA AACTACCTGA CATTGCAGGG TGTGTTTGGC
GTGCCGCTCG ATGTGGCGGC CACATTCATC GTGCTGTTTA CGATCTATGG CGCGGTGCTG
GAGTACAGCG GCGCGGGCAA GTTCTTTATC GACTGGTCCT TTGCGGCGCT CGGCAAGTCG
AAGAGCGGCG CCGGCCCGGG ACGCACCGTG GCGGCGGCCG GGTTTTTGCT CGGCACCGTG
TCGGGCAGTG GCGTGGCAAC GACGGTCACA CTGGGATCGC TGTCGTGGCC TATGCTGCGC
AAGGCAGGGT ACGATAAGAC GGTCGCGGCC GGAATGCTGG CTGCATCCGG GATCGGCGCC
ACCCTCTCGC CGCCAACCCT TGGAGCAGCA GCGTTTCTGA TCGCCGAATA CCTGGATATT
TCCTATCTCG ATGTGCTGAT CATGGCGATT GTGCCGACGA TCCTCTACTA CCTGTCGATC
ATTCTGATGA TCGAAGCCGA CTCGCGCCGC ATGAAGACAC AGGCGGTCAC ATTCGAGAGC
GAATCGCTCT GGGAGTTGAC GCGCAAGTTT GGGTATCACT TCTCGTCGCT GTTTGCCGTC
GCCATTCTGA TGGGGTTTGG CATGACGCCC TTTATGGCGG TCTACTGGTC GATTGTCGTT
GCGTTCTTCC TGAGTTTTCT CCGTCCCGAA ACGCGCCTCT CCTCCCTGAA AGCGCTGGCG
TCCGGCGTCG CGCTGATGGC GCTGCTGATG GCGCTGGAAG TGACCGGTGT GCTGCCGCGT
ATGCGTCCAT CGGTGGCGAT CTTCTGGGGG TTAATGCTTA CCGTCGTGAT CGCGGCGGCG
ATGGCGCTCT ACCGGCGGGT GCGCGCCATT CCGGGAGAAG ATGAGAACAT GCGCATTCTC
AGGGCGCTGG AATATGGCGG GCGCAGTGTG GTGTCGATTG CCGCCACCAC CGCCTGCGCA
GGCATCATCG TCTCAGCCGT GACGTTGACC GGACTGGGGC TTAAGATTTC GGGCATGATC
GTGAGCCTTG GCGGCGGCAA CATCCTGATG ACGGTCTTCT TTGCAGCCAT TGCTGTCTGG
GTATTGGGAT TGGCGGTGCC GGTCACGGCA TCGTACATTC TGGCAGCGGT CATGATCGTT
CCCGCGTTGC GACAGGTGGG TGTGCCGGAA CCCGCTGCGC ATATGTTCAT TTTCTATTAT
GCGGTCCTGG CTGATGTGTC GCCGCCGACT GCGCTGGCGC CTATGGCTGC CGCAGCGATT
ACCGGCGGGC GTCCGTGGCC CACAATGTTT ATGGCATGGA AGTATTGTCT GCCCGGTTTC
CTGGTGCCGT TCATGTTTAC GATGACGACC GACGGGACGA GTCTGCTGCT GTTGTTGCAG
CAGGTTGGCA AAGATGTGGG CACGGTGACG CTGGCGTTCA AACCTGCCTG GTATGAGGCG
CTGGCGGCAG GAGGGTGGAT GACGATTGTG GTGACCTTCC TGACCAGTTG CCTGGCAGTT
GGCGCCCTGG CGGTCGCTTT TGGCGGATGG CTGCTCCGTC AGGCGAATCT TTTCGAGCGC
GTGCTGATGG GTGTTGCCGG TTTGGCGATG CTCTATGCCG ACCTGGGAGC GGACGCCGTT
GGGTTTGGGT TGTTTATTGT GGGCGTGCTG GTTCACGTGG TGCGGGTGCG CCAGATGCGC
AAGGCGGAGT CGGTCGCTGT CGCCGCAGTG GAAACGGACA CGTTCAGCGA GCGCTCGGTT
GGATGA
 
Protein sequence
MSIAANAQNN DAPDEEVISR EKVEQLIEEF ENEAATRKLS GAWAWIAGIV AAALSIYALY 
WTQAIITTQV YRATFLMLVL ALTFFYFPLR KAARTKVPWY DVVLAALGAA SMIYLSLNFR
DALQRVTQPT PTELVMGAIM LLLVLEATRR TTGMALTLVA VFSILYALFG YVFPEPFDHR
GISLQRLIGT NYLTLQGVFG VPLDVAATFI VLFTIYGAVL EYSGAGKFFI DWSFAALGKS
KSGAGPGRTV AAAGFLLGTV SGSGVATTVT LGSLSWPMLR KAGYDKTVAA GMLAASGIGA
TLSPPTLGAA AFLIAEYLDI SYLDVLIMAI VPTILYYLSI ILMIEADSRR MKTQAVTFES
ESLWELTRKF GYHFSSLFAV AILMGFGMTP FMAVYWSIVV AFFLSFLRPE TRLSSLKALA
SGVALMALLM ALEVTGVLPR MRPSVAIFWG LMLTVVIAAA MALYRRVRAI PGEDENMRIL
RALEYGGRSV VSIAATTACA GIIVSAVTLT GLGLKISGMI VSLGGGNILM TVFFAAIAVW
VLGLAVPVTA SYILAAVMIV PALRQVGVPE PAAHMFIFYY AVLADVSPPT ALAPMAAAAI
TGGRPWPTMF MAWKYCLPGF LVPFMFTMTT DGTSLLLLLQ QVGKDVGTVT LAFKPAWYEA
LAAGGWMTIV VTFLTSCLAV GALAVAFGGW LLRQANLFER VLMGVAGLAM LYADLGADAV
GFGLFIVGVL VHVVRVRQMR KAESVAVAAV ETDTFSERSV G