Gene Rpal_5116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5116 
Symbol 
ID6412810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5498526 
End bp5500406 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content66% 
IMG OID642715001 
Productferrous iron transport protein B 
Protein accessionYP_001994080 
Protein GI192293475 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0370] Fe2+ transport system protein B 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00437] ferrous iron transporter FeoB 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTCGG TCGATTCTCA GCCGTTCAAC CTTGCCCTGG TTGGAACGCC CAATAGCGGT 
AAGACCTCCC TGTTCAACGC GCTGACGGGC AGCCGTCAGA AGGTGGCGAA CTACCCGGGC
GTGACCGTCG AGCGCAAGAC CGGCTTGTTC ACCACGCCGG CCGGCCGCGT CGTCAGCCTG
GTCGACCTGC CGGGCACCTA TTCGCTGCGC GGCCGCAGCC CCGATGAAGA GATCACCCGC
GATGTCGTGC TCGGTCGCAA GTCGGACGAG CCGGTGCCGG ACCTGGTGCT GTGTGTCGCC
GACTCGACAA ACCTGCGGCT GACCTTCCGG CTGATGCTGG AATTGAAGTC GACCGGCCGT
CCGTTGATGC TCGTGCTGAA CATGTACGAC ATCGCCATGC GCCGCGGCGT TACGGTCGAC
GTCGAGAAAC TGTCGGCGCA GCTTGGTATT CCGGTGGTGA CGTCGATCGC GGTCCGCAAG
GGCGGCACGG CGGAGCTGCT GAAGCGGACC GATGAATTCG CCGCCCAGGC GCCAGCGCCC
GAGACCGACA CCCATTGGCG GCCGCTGTCC ACTAGCGAGC TGCGTGCCCT GCAGCGCGAG
GCCGACCGCA TCATCGGTGA ATGCGTCAGC CTGCCGGCGC GGCCTCACAC CCTGACGGCT
CAGGTCGATC GGGTGGTGCT GCATCCCGTC GCCGGCCTGC TGATCCTGGT CCTGATCCTG
TTCGTGATGT TCCAGGCGGT GTTCTCCTGG GCGCAGCCGA TCATGGAGCT GATCTCCGGT
GGCTTCGAGG CCCTCGGCGC CTTCGTCCAG GCCAACATGC CCGAAGGGCT GCTGCAGAGC
TTTCTGCAGA ACGGCGTGAT CTCCGGCGTC GGCAGCGTCC TCGTGTTCCT GCCGCAGATC
ATCATCATCT TCCTGTTCAT CCTGCTGCTG GAAGACCTCG GCTACATGGC GCGCGCCGCG
TTCCTGATGG ATCGCATCAT GGGCGGCGCC GGGCTGCACG GCCGTGCCTT CATTCCGCTG
CTGTCGAGCT TCGCCTGCGC GATCCCCGGC ATCATGGCGA CGCGCGTGAT CGATAACCGC
CGCGACCGCC TCACCACCAT CCTGATCGCG CCGCTGATGA CCTGCTCGGC GCGCATCCCG
GTCTATACGC TGATCATCTC GGCCTTCGTG CCGGCCAAGG AGGTGTTCGG CTGGATCAAT
CTGCAGGGGC TGGTGATGTT CGGCCTCTAT ACTGCGGGCA TCGTCAGCGC GCTGACGGTG
TCGGCGCTGG TCAAGTTCTT CATGTGGCGC GACTACGAGC CGGCACCGTT CATGCTCGAA
CTGCCGGACT ACAAGCTGCC GCGGCTGCAG AGTGTTGCCA TCGGCATCTA TATCCGCGCC
AAGATGTTCC TGCAGCGCGC CGGTACTACC ATCTTCTCGA TGATGGTGCT GATCTGGTTC
CTGGCCTCGT TCCCGCGCCC GCCGATCGGC GCGACCGAGC CGGCGATCGA CTACAGCCTG
GCCTCGATCA TCGGTCACGC GGTCGAGCCG GTACTGGCGC CGCTCGGCTT CAACTGGCAG
ATCGCGGTGG CGCTGATCCC CGGCATGGCG GCGCGCGAAG TCGCGGTCGC GGCGCTCGGC
ACCGTCTATG CGATTGAAGG CGGCAAGGAG GCGGCCGAGC AGATTGGCCA AGTGCTGGCC
TCGAAGTGGA GCCTGGCCAC CGCGCTGTCG CTGCTGGCCT GGTTCGTGTT CGCGCCGCAA
TGCGCCTCGA CGCTGGCGGT GATCAAGCGC GAGACCGGAA GCTGGCGCTG GATGGGCGTC
ACCTTCATCT ACATGTTCGC GCTCGCCTAC ATTGCCAGCC TGATCACCTA CAACGTCGCG
GTCGCCTTCG GCGCCGGCTA G
 
Protein sequence
MSSVDSQPFN LALVGTPNSG KTSLFNALTG SRQKVANYPG VTVERKTGLF TTPAGRVVSL 
VDLPGTYSLR GRSPDEEITR DVVLGRKSDE PVPDLVLCVA DSTNLRLTFR LMLELKSTGR
PLMLVLNMYD IAMRRGVTVD VEKLSAQLGI PVVTSIAVRK GGTAELLKRT DEFAAQAPAP
ETDTHWRPLS TSELRALQRE ADRIIGECVS LPARPHTLTA QVDRVVLHPV AGLLILVLIL
FVMFQAVFSW AQPIMELISG GFEALGAFVQ ANMPEGLLQS FLQNGVISGV GSVLVFLPQI
IIIFLFILLL EDLGYMARAA FLMDRIMGGA GLHGRAFIPL LSSFACAIPG IMATRVIDNR
RDRLTTILIA PLMTCSARIP VYTLIISAFV PAKEVFGWIN LQGLVMFGLY TAGIVSALTV
SALVKFFMWR DYEPAPFMLE LPDYKLPRLQ SVAIGIYIRA KMFLQRAGTT IFSMMVLIWF
LASFPRPPIG ATEPAIDYSL ASIIGHAVEP VLAPLGFNWQ IAVALIPGMA AREVAVAALG
TVYAIEGGKE AAEQIGQVLA SKWSLATALS LLAWFVFAPQ CASTLAVIKR ETGSWRWMGV
TFIYMFALAY IASLITYNVA VAFGAG