Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ppha_0436 |
Symbol | |
ID | 6462211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pelodictyon phaeoclathratiforme BU-1 |
Kingdom | Bacteria |
Replicon accession | NC_011060 |
Strand | - |
Start bp | 421183 |
End bp | 422163 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642726723 |
Product | KpsF/GutQ family protein |
Protein accession | YP_002017379 |
Protein GI | 194335585 |
COG category | [K] Transcription [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0794] Predicted sugar phosphate isomerase involved in capsule formation [COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains |
TIGRFAM ID | [TIGR00393] KpsF/GutQ family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATAC CATCCCAAAA ACAGGAAATT ACAGCGCTTG GAAAAATAAT TCTTGATCAG GAAGCCCGCG CCATTCATTT GATAGCCGAA CGACTTGATG AGAGTTTTTC GGCTGCCGTG GAACTTCTTG CATCATGCCA GGGAAAAATC ATCATCTCCG GTATGGGGAA ATCAGGTATC ATTGCACAAA AAATAGCCGC AACGATGTCA TCAACTGGTT CAACAGCCCT TTTTCTTCAT CCGGCAGATG CCGCTCATGG AGATCTTGGG ATTGTCGGTC ATACCGATAC CGTCATCTGT CTTTCGAAAA GCGGCACAAC CGAGGAGCTG AACTTTATCA TCCCGGCACT CCGCCAGATT GGTGCAAAAA TTATTGCGAT GACTGGCAAT CCCCGCTCTT TTCTTGCCCA GAAAGCGGAT ATCACACTTG ATACCGGCAT AGCAAAAGAG GCTTGTCCGT ATGATCTGGC ACCGACAACC TCGACAACGG CTATGCTCGC CATGGGAGAT GCCCTTGCCA TTGCGTTGAT GCAGGTGAAA AATTTCACAC AAAGAGATTT TGCGCTGACC CATCCCAAAG GATCACTCGG ACGACGGTTG ACTGTAAAAG TAAGTGACAT CATGGCAAAA GGCGATGCTG TCCCTATCGT GTCAGAGAGT GCTTCGGTGA CCGGTCTTAT CCTTGAAATG ACATCGAAGC GATATGGAGT AAGCGCTGTT ATAACTGATG ATGGTAAGCT GTGCGGCATT TTTACCGATG GTGACCTTCG TCGCCTGGTG CAGAGTGGCA GGGAGTTCCT CAACCTCAGC GCGGGTTCAG TTATGACTGC AAATCCCAAG ACTGTCACAG GCGACACCAT GGCTAAAGAG TGTCTTGACA TTCTTGAAAC CTGGCGCATT ACACAGTTGC TGGTGTGCGA CGATGAACAG CATCCTGTTG GAATGGTGCA TATTCATGAT TTAATTGTAC TGGGGTTATA G
|
Protein sequence | MSIPSQKQEI TALGKIILDQ EARAIHLIAE RLDESFSAAV ELLASCQGKI IISGMGKSGI IAQKIAATMS STGSTALFLH PADAAHGDLG IVGHTDTVIC LSKSGTTEEL NFIIPALRQI GAKIIAMTGN PRSFLAQKAD ITLDTGIAKE ACPYDLAPTT STTAMLAMGD ALAIALMQVK NFTQRDFALT HPKGSLGRRL TVKVSDIMAK GDAVPIVSES ASVTGLILEM TSKRYGVSAV ITDDGKLCGI FTDGDLRRLV QSGREFLNLS AGSVMTANPK TVTGDTMAKE CLDILETWRI TQLLVCDDEQ HPVGMVHIHD LIVLGL
|
| |