Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_1624 |
Symbol | |
ID | 6409281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 1738722 |
End bp | 1740407 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642711513 |
Product | transcriptional regulator, NifA subfamily, Fis Family |
Protein accession | YP_001990628 |
Protein GI | 192290023 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains |
TIGRFAM ID | [TIGR01817] Nif-specific regulatory protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.714408 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGATGG CCCAAAATTC CCGCCGAGCT CCTGCCGCCC CCATCGCGCC CGAGCCCCCC AAACGCCTCG CGACGCTGGA CCTCAGCGGC GAACCGGAAG GTTTCGTCGA CGAATTCACC CACTGCTTCA CCGGCGAATG CCGGGTCAAT GTGTTGCAGA TCCTTTTTCG CATCAATCAG GTGCTGACGC AGAACGCTGA CCTCGCCACG CTGGTGTCGA TCATTCTCGA CGGCATGCGC CAGCAGATGC GGATGCAGCG CGGCGTGATG ATGCTGTACG ATCGCCACTC CGACGCGATC TTCATCCACG ACAGCTTCGG ACTGACCGAG GAGGAGCGCG GCCGCGGCAT CTACGCGCCC GGCGAAGGAA TCACCGGCAA GGTGGTCGAG ACCGGCAAGC CGATCATCAT CCCGCGGGTG ATCGACAGCC CGGACTTTCT CGATCGCACC CGCGCCCACA ACAAGGGGCG CAAGCAGGAC AAGCTGGCGT TCTTCTGCGT GCCGATCGTG CTCGCCCAGA AGGTGCTCGG CACCATCTGC GCCGAGCGCG TCTATATGAA TCAGCGGCTG CTGAATCAGG ATGCCGAACT GCTGGCGATG GTCGCCTCGC TGATCGCACC GGCGGTCGAG CTGTATCTGA TCGAGAACGT CGACAAGGTG CGGCTGGAGA CCGAGAACAG GCGGCTGAAA AGCGAGCTGA AGCAGCGCTT TCGCCCGGCC AACATCATCG GCAACTCCAA GCCGATGCAG GAAGTCTACG CCATGGTGCA CAAGGTGGCC TCGACCAAGG CCACCGTGCT GCTGCTCGGC GAAAGCGGCG TCGGCAAGGA GCTGGTGGCG AGCGCGCTGC ACTACAACAG CCCAGTGGCC GACGGCCCGT TCATCAAGGC GAACTGCGCT GCGCTGCCCG AAGCGCTGGC GGAGAGCGAG CTGTTCGGCC ACGAGCGCGG CGCGTTCACC AGCGCGATCG CCACTCACAA GGGTTACTTC GAACAGGCAT CTGGCGGCAC GATCTTTCTC GACGAGGTCG GCGAATTAAG TCTACCGACG CAGGCCAAGC TGCTGCGCGT ACTGCAGGAA CGGACGTTCG AGCGCGTCGG CGGCGCCAAG CCGGTCAAAG TCGATGTCCG GATCATCGCC GCCACCAACC GCAATCTCGC CGAGATGGTC GCTGAGGGCA CCTTCCGCGA AGATCTGTTC TATCGCCTCA ACGTCTTCCC GATCACCATC CCGCCGCTGC GCGATCGCGG CTCGGACGTG ATCACCCTCG CGGACCACTT CGTCACCACC TATTCCGCCG AAATCGGCAA ACCGATCAAA CGGATCTCGA CGCCTGCGAT CAACATGCTG ATGAGCTATC ACTGGCCCGG CAACGTCCGC GAGCTGGAGA ACGTGATCGA GCGATCGGTG ATCCTGGCGG AGGAAGGCGT GATCCACGGC TACGATCTGC CGCCGTCGCT GCAGACGCCG ACCGAAACCG GGACCGGCTT CAGTGGCACG CTCGAAGACC GCGTCACGGC AGTCGAATAC GAGATGATCG TCGAGGCGCT CAAAGCCTCG AATGGCAATG TCGGCCAGGC CGCCACCACG CTCGGCCTGA CGCGGCGAAT GCTCGGCCTG CGGATGGAGC GCCACGGACT GACCTACAAG ACATTCCGCA CCGCAGGGCT GCGCCCGCGG AACTGA
|
Protein sequence | MPMAQNSRRA PAAPIAPEPP KRLATLDLSG EPEGFVDEFT HCFTGECRVN VLQILFRINQ VLTQNADLAT LVSIILDGMR QQMRMQRGVM MLYDRHSDAI FIHDSFGLTE EERGRGIYAP GEGITGKVVE TGKPIIIPRV IDSPDFLDRT RAHNKGRKQD KLAFFCVPIV LAQKVLGTIC AERVYMNQRL LNQDAELLAM VASLIAPAVE LYLIENVDKV RLETENRRLK SELKQRFRPA NIIGNSKPMQ EVYAMVHKVA STKATVLLLG ESGVGKELVA SALHYNSPVA DGPFIKANCA ALPEALAESE LFGHERGAFT SAIATHKGYF EQASGGTIFL DEVGELSLPT QAKLLRVLQE RTFERVGGAK PVKVDVRIIA ATNRNLAEMV AEGTFREDLF YRLNVFPITI PPLRDRGSDV ITLADHFVTT YSAEIGKPIK RISTPAINML MSYHWPGNVR ELENVIERSV ILAEEGVIHG YDLPPSLQTP TETGTGFSGT LEDRVTAVEY EMIVEALKAS NGNVGQAATT LGLTRRMLGL RMERHGLTYK TFRTAGLRPR N
|
| |