Gene Rcas_3601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3601 
Symbol 
ID5541102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4699923 
End bp4701266 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content62% 
IMG OID640895720 
Producthypothetical protein 
Protein accessionYP_001433668 
Protein GI156743539 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.413111 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAGGG CGCCTGAAAC GCAACGACAA CTCGAAGAGT TGCGCCGGGT GTTGCTCAAG 
CCGGAAGCGC TGGTTGATCG CATCAGCCCG GTTATTGCCG ATATTCTGGC GGAACAGATC
AATCAGTCGC GTGATGAAAT TGCACACGCC ATAGCGCCTG CTATCGGTGA AGCCATTCGC
CACCAGGTCT ATCAGGCGCG TGAAGATATT GTCGATGCGC TCTACCCCGT GGTTGGGCAG
ATGATCACCC GCGCCGTCGC CGAAGCAGTG CGCAATCTGG CGCAATCGAT CGATGAGCGC
GTTCGTCAAA GCACCTCGGT GATGCTCAGC CCTCGCTACT GGCAGGCGCG CGTGCAGGGG
GTCTCACATG GCGAGTACGC CTTGCGCGAA GTCCTTCCCT TTACTATTCA TGAACTCTTC
CTGATCCAAC GCGAGTCGGG GGTGCTGATC TGCCACTATT CCGCCGGACC AGAACGCCCG
GACCGCGATG TCGTCAGTGG GATGCTCACT GCTATTCGTG ACTTCGCGCA GGAAGCGTTT
GGGCGCGAAG AAAGCGGAGA ACTCGGCGCC ATTACCTATG AGTCGCGCCA GATCATCCTC
GAGACCGGAA GCGCCGCATA CCTGGCAGTG GTCATCAGCG GTGTTGAGCC GCCAGACTTC
CGCGAACGCC TGCGCGAAAC GCTCTTTGCC ATTCACGAAC ATCGCTACGA GCGTCTCCGC
GCCTTCGACG GAACCGATGC CCGACTGATC CAGGAAGCGC GTCAGACACT GCGGCAACAT
CTGGTTCCGC AGCAGGAAGA CCACCCTCCG CGACGTCTCT CGATGCTTCA GCGCGTAATT
GTGGTTGTGA TCGGATTGTT TGTGCTGTCG CCGTTGCTCC TCTGTGGCGC CTGGATCTGG
CATGTCGAAA CGCGAATGGC GATGCTCATG ACGCCGCCGA TTGCAGCGCC AACGGCGACC
GCCACGCCAA CGGCGACCGC CACGCCGACG CCTACCAGCA CGCCGACGGC AACCGCCACG
CCGACGCCTA CCAGCACGCC GACGGCAACC GCCACGCCGA CGGCAACCGC CACGCCGACG
GCGACGCCTT CGCCATTCAA TGGTGTCATG ATCGGAAACG TGTATCTGTA CAGCACGCCG
GACGAAGCCA GTACACGCAC CGGTATCGTT GCACCGCTCG GCGCGCCGGT CGAAGTGCTG
GCACAGCGAG GTGATTGGTA CCGAGTGCGG GTAGCGCTGC CGCAAAACCC GCAGGTCGAA
CTGATCGGAT GGATCCCGGC GCGTTGGGTC AGCCTGCTCA AACCGGTGCC GCCCGAAGTA
ATTACGCCGA CTGCAACACA GTAG
 
Protein sequence
MVRAPETQRQ LEELRRVLLK PEALVDRISP VIADILAEQI NQSRDEIAHA IAPAIGEAIR 
HQVYQAREDI VDALYPVVGQ MITRAVAEAV RNLAQSIDER VRQSTSVMLS PRYWQARVQG
VSHGEYALRE VLPFTIHELF LIQRESGVLI CHYSAGPERP DRDVVSGMLT AIRDFAQEAF
GREESGELGA ITYESRQIIL ETGSAAYLAV VISGVEPPDF RERLRETLFA IHEHRYERLR
AFDGTDARLI QEARQTLRQH LVPQQEDHPP RRLSMLQRVI VVVIGLFVLS PLLLCGAWIW
HVETRMAMLM TPPIAAPTAT ATPTATATPT PTSTPTATAT PTPTSTPTAT ATPTATATPT
ATPSPFNGVM IGNVYLYSTP DEASTRTGIV APLGAPVEVL AQRGDWYRVR VALPQNPQVE
LIGWIPARWV SLLKPVPPEV ITPTATQ