Gene Rcas_4139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4139 
Symbol 
ID5541650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5358374 
End bp5360194 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content61% 
IMG OID640896250 
ProductABC-2 type transporter 
Protein accessionYP_001434188 
Protein GI156744059 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1682] ABC-type polysaccharide/polyol phosphate export systems, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.795712 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00294376 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCATGG CACGTCTTGC AGCGCCTGCA ACCTGGATCG TGGCATTGAG TCGCCAACGG 
GGACCATTGC GTCTTTACCA GTGGCTCTGG CTGCTGGTAT GTGCGCTGGG TGCGCTGGCT
GCGGCTGCGC CGCGCATTCT TTCGCAGCCG ATCCGTTATG AAACGGTCGC GACGGTGCAG
ATTGACGCCG TTGGGCGCTA CCATGAGTTG TATACCGACG GTCAACCAGA CGATGATTAC
CGCGCTGTTG AGATGCAGGC GTTTGAGTTG CTCAGGGCGC GCCGCCCTGA CCTCGGCGGT
CCAACGTATG CGGTGCGTTT TGTGCCGTAT TCTGATGGGC GGGTCGAGAT CATTGCCATC
GGGCGCGCGC CGCTCGAAGC GCAAGTGGTG GCGGACGAAG CGGCAGAAAC GTTGGCGCGC
GCGGTACGCG CCGCCGGTGG GCGCGAGATT CTGCGTAATC TGATGGGCTG GGAACTGACC
GAAGCGCTCC AGGGTCGCCA GCCGGAAACA GCATTTCAGC GCCTGTTGCG CGAAATTATT
CGGACGCAGG CGTTTCCCCT CAACCGGGCG GTCGAACCGG TATCGGCGTA TATTACCGTT
GATCAACTGC CGCAGGAAGA GTTGAGCGAC CTGGCTCGCG CCCTGGAAGT GCGCGAGGCG
CAGTTGACCC GTATCGATAT TCCTGCGCTG GAAATACGCC GCACCACGGC GACCGGTGCG
ACGCTCCAAC AGATCGACGC CGACCTGTTG CGGTTGACGG CGGGGCGGCA GGCGATCCGC
GAAGCGCTGG CGTATCTGTA CCGTAATCTG GGAGCAGCCT TCGCTCCCGA CGCGCCGGGT
GATGCCTATC GCGCCAGCCG CGCACCGCTG CCGGAACGTG CCGTTGATCG GCGTATTCCG
CTCCTTCTTT CACTGGCGAC GGTTGTCGGA GTGCTGTTCG GCGCGGCTGG CGTGGCGATT
GATCGGAGCG CCGGCGCGAT GCGCAAAGTC CTCGAACTTT GGGCGTATCG TGAACTGATC
CGCAACCTGG TGCTGCGCGA TTTGCAGGTG CGTTATAAAG GAAGCGCGCT CGGGTATCTC
TGGACGCAAC TCGCACCCCT CCTGCTGATG CTGGTCTTCT GGTTTGTCTT CAGCGCCTTC
TTTCAGGCGG ATATCGCCAT GTTCCCAGTG TTCCTTCTGG TCGGCTTGCT GCCCTGGAAC
TTCGCCAGCG AGGCGGTGAA CGGTGGGGCG CGCAGTGTGA TCGACAACGC CGCGCTGGTC
AAAAAAGTGT TCTTCCCGCG CGAGGTGCTG CCGCTGGTCG CGGTGCTCTC CAGCCTGGTG
AATTTTGTGC TGTCGCTGCC GATGCTGCTG CTGGTGATGG CGGCGGTTCA GTGGATGTAT
GCGCCGCTGC GCGCTATCGG CGCCTGGACG AACTTCTCGT GGACATTTGT GTATCTGCCG
GTGTTGATCG GCATTCAAAC TATTTTTCTG GCAGGAGTGG CATTGTTCCT CAGCGCACTT
GCGGTGCGCT ACCGCGATAC GGTCCACCTG ATCGGCATTT TCATTCAGTT CTGGTTTTTT
CTTACACCGG TCATCTATGC ACTCGACCGG GTCGCCGGTC CGTTGGCGCA GCTGGTGCGC
TGGCTGAACC CGATGGCGTC GCTGATCGAG TTCTACCGCG AAATCCTGTA TGGCAACGCG
GTTGCGTTCG GTCAGATTCC TACGCCAAAC CTGCCTGCGC TCGACAGTGT GCTGCGCGTT
CTGCTGACCG CCTTCGCAAC GCTGGCGATC GGCTATTGGT ACTTCCAGCG CCGTAGCGGT
GAGTTCGGAG AACGGTTGTA G
 
Protein sequence
MSMARLAAPA TWIVALSRQR GPLRLYQWLW LLVCALGALA AAAPRILSQP IRYETVATVQ 
IDAVGRYHEL YTDGQPDDDY RAVEMQAFEL LRARRPDLGG PTYAVRFVPY SDGRVEIIAI
GRAPLEAQVV ADEAAETLAR AVRAAGGREI LRNLMGWELT EALQGRQPET AFQRLLREII
RTQAFPLNRA VEPVSAYITV DQLPQEELSD LARALEVREA QLTRIDIPAL EIRRTTATGA
TLQQIDADLL RLTAGRQAIR EALAYLYRNL GAAFAPDAPG DAYRASRAPL PERAVDRRIP
LLLSLATVVG VLFGAAGVAI DRSAGAMRKV LELWAYRELI RNLVLRDLQV RYKGSALGYL
WTQLAPLLLM LVFWFVFSAF FQADIAMFPV FLLVGLLPWN FASEAVNGGA RSVIDNAALV
KKVFFPREVL PLVAVLSSLV NFVLSLPMLL LVMAAVQWMY APLRAIGAWT NFSWTFVYLP
VLIGIQTIFL AGVALFLSAL AVRYRDTVHL IGIFIQFWFF LTPVIYALDR VAGPLAQLVR
WLNPMASLIE FYREILYGNA VAFGQIPTPN LPALDSVLRV LLTAFATLAI GYWYFQRRSG
EFGERL