Gene Rpal_2778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2778 
Symbol 
ID6410442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3020196 
End bp3021173 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content62% 
IMG OID642712654 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_001991762 
Protein GI192291157 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTCGA TCGGAAGACT CGTCGGTGCA GCCACCCTCG TGGCCTCGTT AGCGGTGTCG 
ACGGCGGTTC CCACCGCTGC GATGGCGAAG CAGGTGAAGA TCTCTCAAGC CTTCCAGTCC
ATGCTCTACC TGCCGTTCTA CGTCGCGCTG GATAAGGGCT TCTTCAAGCA GCAGGGCCTC
GACGTCGATA AGGAGACCGC CGGTTCGCCG ACCACCGCGT TGTCCGCTGT TTTGTCCGGC
AGTGCGGCGT TCTCGATCCA CGGTCCGGAG TGGACCGCGA TCGCCAATTC GAAGGGCGCC
GATGTCGGCA TCATCTGCAA CGTCGTGAAC GGCGCCGCGG TGTGGGTCGC TACGACGCCG
GACTTCAAGT ACGACACGCT TCAGGACCTG AAGGGGCAGA AGGTTGTCGC AGGCCTGATG
CCGACCACCA GCACGTCGTT GTTCATGAAG CTGCTGAAGG ACAACGGCTT GAAGCCGGAC
TCTGACGTCG ATCTGCTGCA GGTGCAGATC GGCTCCGAGC CGGGCCCGTT CCTCGGCGGA
CAGGCCAAGG TGGCGGTGCT GTACGAGCCC GGCCTGGATC AGGTGGTGGC GAAGGGCATG
AAGGTGGCGA TCGGCTTCCC GAAGGCTTAC GGTCCGTACG CGTTCTCGTC GATCACCGCG
CGGAAGAACG TCGATCCGAA GGACGCGCAG GCGGTCGTCA ACGCGATGGA GCTGGCGCTG
CGCTTCATGA AGAACAACCC GGATGAAGCG GTCTCGATCG CCCAGAAGGA GTTCCCGACT
CTTGATCCGA AGATCGTCGA GGCGGCGGTG CGGCGGATGA TCGCCGAGAA CGTCTATCCG
AGCAGCGTAC AGACGACGCA GCAGGCCTAT GAGACCGCGA TGCAGACACA GATCGCGCTC
GGCAATCTGA AGCAGGCGCC GAAGTACGAG GATTTCGTCA TCCAGGACTA CGTCAAGCCG
GCGCTCGCAC TGAAGTAA
 
Protein sequence
MRSIGRLVGA ATLVASLAVS TAVPTAAMAK QVKISQAFQS MLYLPFYVAL DKGFFKQQGL 
DVDKETAGSP TTALSAVLSG SAAFSIHGPE WTAIANSKGA DVGIICNVVN GAAVWVATTP
DFKYDTLQDL KGQKVVAGLM PTTSTSLFMK LLKDNGLKPD SDVDLLQVQI GSEPGPFLGG
QAKVAVLYEP GLDQVVAKGM KVAIGFPKAY GPYAFSSITA RKNVDPKDAQ AVVNAMELAL
RFMKNNPDEA VSIAQKEFPT LDPKIVEAAV RRMIAENVYP SSVQTTQQAY ETAMQTQIAL
GNLKQAPKYE DFVIQDYVKP ALALK