Gene Rcas_1234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1234 
Symbol 
ID5538703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1594635 
End bp1595879 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content62% 
IMG OID640893369 
ProductABC-type Fe3+ transport system periplasmic component-like protein 
Protein accessionYP_001431349 
Protein GI156741220 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGCCG AAACCAGAGA CTGGCGCCGA TCGCGCCGGG CTTTTCTGCG TGCAGCGGTT 
GGCGCCGGTG CAGGCGCCAT TCTTGCCGCC TGTGGCCAGG CTGCTCCAAC AGCCCCGGCA
GCAACGACCG CTCCGGCTGC GACCACGGCG CCCGCAGCCG AAGCAACCGC ACCTGCCCAA
CTCCCATCGC CGCCTGCACC TGGTCCATTG ACCGCTGCGG ATGCCGGCGG CATAGAGAAG
TTGATCGAAC GGGCGCGCGC GGAAGGCAAC CTGTCCACCA TTGCGCTGCC GGACGATTGG
GCAAACTATG GCGAGATGAA GCAGAAATTT CTCGAGAAGT ATCCCTTCAT CAAACACGAA
GATCTCAATC CCGACGCCAG TTCCGCGCAG GAGATCGAGG CGATCAAAGC CAACGCTGGC
AGCAAGGGCC CACAGGCGCC CGATGTAATC GATGTCGGGT TCACCTGGGG CGATACGGCA
AAGAAAGAAG GTCTCCTCCA ACCTTACAAG GTTGAGAAGT GGGATGAAAT CCCGGAGACG
CTGAAAGACC CCGAAGGGTT CTGGTATGCC GACTACTATG GCGTTATGGC GTTTGAAGTG
AACACGCAGG TGGTTCAGAA CATCCCGCAG GACTGGTCGG ACCTGTTGAA GCCGGAGTAT
AAGGGGCAGG TGGCGCTGGC AGGCGACCCG ACCGGTTCCG GGCAGGCGAT CAACGCGGTG
TGGGCTGCGG CGCTCGGCAA CGGCGGCTCG CTCGACAATC CCATGCCGGG GCTGGAGTTC
TTCAAGAAAC TGAACGAGGC GGGCAACCTG CTGCCGGTTG TCGCCAAACC TGCAACCATC
GCCAAGGGCG AAACGCCCAT CGCGCTGCGC TGGGATTATA ATGCGCTGGC AAACCGCGAT
CAGAACGCCG GGAATATCGA CATCGCCGTG GTTGTGCCGA AAAGCGGATC GCTAGCCGGC
GTCTATGTGC AGGCGATCAG CGCCTATGCG CCACGCCCGC ACGCAGCCCG GCTCTGGATG
GAGTTCCTCT ACTCCGATGA GGGTCAGTTG ATCTGGCTTA AGGGGTACGC GACGCCGGCG
CGCTTCGAGG CAATGCGCAA AGCCGGGTTG ATCCCGCAGG ATTTACTCGA CAAATTGCCG
AAGACCGATG CGCCTGTGGC GTTCCCAACC GGTGGTCAGA TCAATGCTGC CTTCGACATG
ATCAAGAGCA ACTGGCCTAC GGTCGTTGGT GCAACGGTGC AGTAG
 
Protein sequence
MSAETRDWRR SRRAFLRAAV GAGAGAILAA CGQAAPTAPA ATTAPAATTA PAAEATAPAQ 
LPSPPAPGPL TAADAGGIEK LIERARAEGN LSTIALPDDW ANYGEMKQKF LEKYPFIKHE
DLNPDASSAQ EIEAIKANAG SKGPQAPDVI DVGFTWGDTA KKEGLLQPYK VEKWDEIPET
LKDPEGFWYA DYYGVMAFEV NTQVVQNIPQ DWSDLLKPEY KGQVALAGDP TGSGQAINAV
WAAALGNGGS LDNPMPGLEF FKKLNEAGNL LPVVAKPATI AKGETPIALR WDYNALANRD
QNAGNIDIAV VVPKSGSLAG VYVQAISAYA PRPHAARLWM EFLYSDEGQL IWLKGYATPA
RFEAMRKAGL IPQDLLDKLP KTDAPVAFPT GGQINAAFDM IKSNWPTVVG ATVQ