Gene Rpal_4590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4590 
Symbol 
ID6412274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4947980 
End bp4949230 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content62% 
IMG OID642714470 
ProductABC transporter related 
Protein accessionYP_001993559 
Protein GI192292954 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1134] ABC-type polysaccharide/polyol phosphate transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.10294 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTGATT TGGCGATCTC CATCCGCAAC CTGTCCAAGG TATACAAGAT CTACTCGCGC 
CCGGCCGACA TCCTGGTGGA GATGTTGACG CGCGCTAAGC GGCATCGCGA ATTCTGGGCG
TTGCGGGACG TCTCCCTGGA CATCAAGCGG GGCGAGGTGG TCGGACTGAT CGGCCGCAAC
GGCGCCGGGA AGACCACCTT GCTGCGAGTG ATCGCAGGCA CGCTCGATGC AAATGCCGGC
GACATCCGCG TCAACGGCAA GGTGTCCGCC ATCATGGCGC TCGGCACCGG TTTCAACCTG
GACCTGACCG GGCGCGAGAA CATCCTGATC GGCGGGCTGG TCCAGGGACT GACCCACGAA
GAAGTCGACG CCAAGACCGA AGAGATCATC AGTTTCTCCG GCCTGCGCGA TTTCATCGAT
CAGCCATGCA AGACCTACTC GAGCGGCATG ATCGCCCGGC TGGCGTTTTC GGTGGCGTCG
AGCATCGAGC CCGACATCCT GATCGTCGAC GAAGCGCTCG CGACTGGCGA CATGGTGTTC
AACGTCAAGA GCTATGCCCG GATGCGGCGG ATCGCGCGCA GCGGCGCCAC CGTCCTTCTC
GTCACCCACA GCCTTCCACA GATTTACGAG CTGTGTGATC GTGCGGTGCT GATCGAAAAC
GGAGCCGTGG CGCTGGATGG AGAGCCGAGG ATCGTCGGAC AGGCCTACGA GGACCTGCTC
CACAAGGAAA TGGAAGCTGC GAATGCCGCA AACGCCGCCG CCACGCCGCC CACGGCGACG
ACCCCGCCGA TCGAACTGAA GGGGTTTGTC GTGGAAGCGA TCCAGTTTAT CGATCAGGCG
CGGCTTCCGG TCCGCGCCCT GACATCGGGA CAGGCGTATC AGCTGGTCAT TCGCGGCAAG
GCGGCGTCGG CGCTGCGATC TTTCAATCTA GGCTTCTCGA TCAGCACCAA TATGGGAACC
GCGATCTACA GCACCAGCAC CGCCGATCAC GGCGGACGTC TCGATCTCGC CGCCGGTGAG
CGGGCCGAAG CGAGGTTCGA CTTTCCCTGC GACCTTAGCG CCGGAAGCTA TTTTCTGACG
ATCGAAACCA GCGAGGGCGA CTCGCCGAGG AAAGTGTTCC CGCTCGGCGA TTCCCGGGAT
CCGTCGGTTG CGGAGATTTT CGAGGTCGCG AACGGGGAGA AATTATTCGC AGGACTGATC
GACCTTGGAA GCAAGCTGCT ACTGGACGAC AACGATTCTC AAGTGAGCTA G
 
Protein sequence
MSDLAISIRN LSKVYKIYSR PADILVEMLT RAKRHREFWA LRDVSLDIKR GEVVGLIGRN 
GAGKTTLLRV IAGTLDANAG DIRVNGKVSA IMALGTGFNL DLTGRENILI GGLVQGLTHE
EVDAKTEEII SFSGLRDFID QPCKTYSSGM IARLAFSVAS SIEPDILIVD EALATGDMVF
NVKSYARMRR IARSGATVLL VTHSLPQIYE LCDRAVLIEN GAVALDGEPR IVGQAYEDLL
HKEMEAANAA NAAATPPTAT TPPIELKGFV VEAIQFIDQA RLPVRALTSG QAYQLVIRGK
AASALRSFNL GFSISTNMGT AIYSTSTADH GGRLDLAAGE RAEARFDFPC DLSAGSYFLT
IETSEGDSPR KVFPLGDSRD PSVAEIFEVA NGEKLFAGLI DLGSKLLLDD NDSQVS