Gene Rpal_1624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1624 
Symbol 
ID6409281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1738722 
End bp1740407 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content64% 
IMG OID642711513 
Producttranscriptional regulator, NifA subfamily, Fis Family 
Protein accessionYP_001990628 
Protein GI192290023 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID[TIGR01817] Nif-specific regulatory protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.714408 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGATGG CCCAAAATTC CCGCCGAGCT CCTGCCGCCC CCATCGCGCC CGAGCCCCCC 
AAACGCCTCG CGACGCTGGA CCTCAGCGGC GAACCGGAAG GTTTCGTCGA CGAATTCACC
CACTGCTTCA CCGGCGAATG CCGGGTCAAT GTGTTGCAGA TCCTTTTTCG CATCAATCAG
GTGCTGACGC AGAACGCTGA CCTCGCCACG CTGGTGTCGA TCATTCTCGA CGGCATGCGC
CAGCAGATGC GGATGCAGCG CGGCGTGATG ATGCTGTACG ATCGCCACTC CGACGCGATC
TTCATCCACG ACAGCTTCGG ACTGACCGAG GAGGAGCGCG GCCGCGGCAT CTACGCGCCC
GGCGAAGGAA TCACCGGCAA GGTGGTCGAG ACCGGCAAGC CGATCATCAT CCCGCGGGTG
ATCGACAGCC CGGACTTTCT CGATCGCACC CGCGCCCACA ACAAGGGGCG CAAGCAGGAC
AAGCTGGCGT TCTTCTGCGT GCCGATCGTG CTCGCCCAGA AGGTGCTCGG CACCATCTGC
GCCGAGCGCG TCTATATGAA TCAGCGGCTG CTGAATCAGG ATGCCGAACT GCTGGCGATG
GTCGCCTCGC TGATCGCACC GGCGGTCGAG CTGTATCTGA TCGAGAACGT CGACAAGGTG
CGGCTGGAGA CCGAGAACAG GCGGCTGAAA AGCGAGCTGA AGCAGCGCTT TCGCCCGGCC
AACATCATCG GCAACTCCAA GCCGATGCAG GAAGTCTACG CCATGGTGCA CAAGGTGGCC
TCGACCAAGG CCACCGTGCT GCTGCTCGGC GAAAGCGGCG TCGGCAAGGA GCTGGTGGCG
AGCGCGCTGC ACTACAACAG CCCAGTGGCC GACGGCCCGT TCATCAAGGC GAACTGCGCT
GCGCTGCCCG AAGCGCTGGC GGAGAGCGAG CTGTTCGGCC ACGAGCGCGG CGCGTTCACC
AGCGCGATCG CCACTCACAA GGGTTACTTC GAACAGGCAT CTGGCGGCAC GATCTTTCTC
GACGAGGTCG GCGAATTAAG TCTACCGACG CAGGCCAAGC TGCTGCGCGT ACTGCAGGAA
CGGACGTTCG AGCGCGTCGG CGGCGCCAAG CCGGTCAAAG TCGATGTCCG GATCATCGCC
GCCACCAACC GCAATCTCGC CGAGATGGTC GCTGAGGGCA CCTTCCGCGA AGATCTGTTC
TATCGCCTCA ACGTCTTCCC GATCACCATC CCGCCGCTGC GCGATCGCGG CTCGGACGTG
ATCACCCTCG CGGACCACTT CGTCACCACC TATTCCGCCG AAATCGGCAA ACCGATCAAA
CGGATCTCGA CGCCTGCGAT CAACATGCTG ATGAGCTATC ACTGGCCCGG CAACGTCCGC
GAGCTGGAGA ACGTGATCGA GCGATCGGTG ATCCTGGCGG AGGAAGGCGT GATCCACGGC
TACGATCTGC CGCCGTCGCT GCAGACGCCG ACCGAAACCG GGACCGGCTT CAGTGGCACG
CTCGAAGACC GCGTCACGGC AGTCGAATAC GAGATGATCG TCGAGGCGCT CAAAGCCTCG
AATGGCAATG TCGGCCAGGC CGCCACCACG CTCGGCCTGA CGCGGCGAAT GCTCGGCCTG
CGGATGGAGC GCCACGGACT GACCTACAAG ACATTCCGCA CCGCAGGGCT GCGCCCGCGG
AACTGA
 
Protein sequence
MPMAQNSRRA PAAPIAPEPP KRLATLDLSG EPEGFVDEFT HCFTGECRVN VLQILFRINQ 
VLTQNADLAT LVSIILDGMR QQMRMQRGVM MLYDRHSDAI FIHDSFGLTE EERGRGIYAP
GEGITGKVVE TGKPIIIPRV IDSPDFLDRT RAHNKGRKQD KLAFFCVPIV LAQKVLGTIC
AERVYMNQRL LNQDAELLAM VASLIAPAVE LYLIENVDKV RLETENRRLK SELKQRFRPA
NIIGNSKPMQ EVYAMVHKVA STKATVLLLG ESGVGKELVA SALHYNSPVA DGPFIKANCA
ALPEALAESE LFGHERGAFT SAIATHKGYF EQASGGTIFL DEVGELSLPT QAKLLRVLQE
RTFERVGGAK PVKVDVRIIA ATNRNLAEMV AEGTFREDLF YRLNVFPITI PPLRDRGSDV
ITLADHFVTT YSAEIGKPIK RISTPAINML MSYHWPGNVR ELENVIERSV ILAEEGVIHG
YDLPPSLQTP TETGTGFSGT LEDRVTAVEY EMIVEALKAS NGNVGQAATT LGLTRRMLGL
RMERHGLTYK TFRTAGLRPR N