Gene Rcas_3620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3620 
Symbol 
ID5541122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4729684 
End bp4730739 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content60% 
IMG OID640895740 
Productpermease 
Protein accessionYP_001433687 
Protein GI156743558 
COG category[R] General function prediction only 
COG ID[COG0701] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCAGA CATTATCGGC GTTATCACTA CGACATTCAT CACGGCGCGC ATGGGGTATT 
GTTGCCGTCG CTGCTGTTGG CTGGCTGATT GCGTACAACC TGATCCAACC GCTGGCGCAC
TGGTTGGCAT ATCGGGTGCT GGGTTTTGCC GAGGGATCAC AGGCAGGCGA GGCGCTCGCC
TTTTTTCTGT ACGACGTGCC AAAGATCCTG CTGCTGTTAA GTGGGATGAT CTTCGGCATC
AGCGTCCTCC GTTCGTTCTT CAGTCCCGAA CGCACCCGCG CCCTGCTCGG CGGCAAGCGT
GAGGGGATCG GGAACGCACT GGCAGCCGGG TTGGGCGTGC TCACACCATT TTGCTCGTGC
TCGGCGGTGC CGCTCTTTAT CGGCTTTGTC GAAGCGGGCA TTCCGCTCGG TGTGACCTTC
TCGTTCCTGA TTGCCGCGCC AATGGTGAAT GAGGTTGCGC TGGCAATGCT CCTTGGCATG
TTTGGCTGGC AGGTAGCCCT ACTCTATCTG GTTGCCGGCA TGAGTGTGGC AATTCTATCC
GGTATCGTTA TCGGGCGCCT GCGCCTGGAG CGCTACGTCG AGGATTTTGT CTGGCAGATC
AAAGGCGGTG GCGGCGCCGT CGCAGGTGAA GCGCCGACAT GGGCTGAACG GTTTGCCTTT
GCATGGGAGA ATACCCGTGA GATTGTCGGC AAGGTCTGGC TCTACGTCGT CATCGGGATT
GCCATTGGCG CCGGCATTCA TGGCTACGTT CCCGAAGAAG CGCTGTCTGG CATCCTTGGG
CGCGAGGCAT GGTGGTCGGT GCCGGCAGGC GTGATTCTCG GCGTGCCGCT CTATTCGAAT
GCCGCCGGTG TAATCCCGGT GGTACAGGCG CTGATGGCGA AAGGCGCCGC TCTGGGAACG
GCGCTTGCCT TTATGATGGC GGTCGTCGCC CTCAGTTTGC CCGAAATGAT CATTCTGCGC
CGGGTACTCA AGCCACAGTT GATCGCCGTG TTTATCGGCG TGGTGGCCGT CGGGATCATG
ATGGTTGGCT ATCTGTTCAA TCTGGTGATG GGCTGA
 
Protein sequence
MEQTLSALSL RHSSRRAWGI VAVAAVGWLI AYNLIQPLAH WLAYRVLGFA EGSQAGEALA 
FFLYDVPKIL LLLSGMIFGI SVLRSFFSPE RTRALLGGKR EGIGNALAAG LGVLTPFCSC
SAVPLFIGFV EAGIPLGVTF SFLIAAPMVN EVALAMLLGM FGWQVALLYL VAGMSVAILS
GIVIGRLRLE RYVEDFVWQI KGGGGAVAGE APTWAERFAF AWENTREIVG KVWLYVVIGI
AIGAGIHGYV PEEALSGILG REAWWSVPAG VILGVPLYSN AAGVIPVVQA LMAKGAALGT
ALAFMMAVVA LSLPEMIILR RVLKPQLIAV FIGVVAVGIM MVGYLFNLVM G