Gene Ppha_0436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_0436 
Symbol 
ID6462211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp421183 
End bp422163 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content48% 
IMG OID642726723 
ProductKpsF/GutQ family protein 
Protein accessionYP_002017379 
Protein GI194335585 
COG category[K] Transcription
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation
[COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATAC CATCCCAAAA ACAGGAAATT ACAGCGCTTG GAAAAATAAT TCTTGATCAG 
GAAGCCCGCG CCATTCATTT GATAGCCGAA CGACTTGATG AGAGTTTTTC GGCTGCCGTG
GAACTTCTTG CATCATGCCA GGGAAAAATC ATCATCTCCG GTATGGGGAA ATCAGGTATC
ATTGCACAAA AAATAGCCGC AACGATGTCA TCAACTGGTT CAACAGCCCT TTTTCTTCAT
CCGGCAGATG CCGCTCATGG AGATCTTGGG ATTGTCGGTC ATACCGATAC CGTCATCTGT
CTTTCGAAAA GCGGCACAAC CGAGGAGCTG AACTTTATCA TCCCGGCACT CCGCCAGATT
GGTGCAAAAA TTATTGCGAT GACTGGCAAT CCCCGCTCTT TTCTTGCCCA GAAAGCGGAT
ATCACACTTG ATACCGGCAT AGCAAAAGAG GCTTGTCCGT ATGATCTGGC ACCGACAACC
TCGACAACGG CTATGCTCGC CATGGGAGAT GCCCTTGCCA TTGCGTTGAT GCAGGTGAAA
AATTTCACAC AAAGAGATTT TGCGCTGACC CATCCCAAAG GATCACTCGG ACGACGGTTG
ACTGTAAAAG TAAGTGACAT CATGGCAAAA GGCGATGCTG TCCCTATCGT GTCAGAGAGT
GCTTCGGTGA CCGGTCTTAT CCTTGAAATG ACATCGAAGC GATATGGAGT AAGCGCTGTT
ATAACTGATG ATGGTAAGCT GTGCGGCATT TTTACCGATG GTGACCTTCG TCGCCTGGTG
CAGAGTGGCA GGGAGTTCCT CAACCTCAGC GCGGGTTCAG TTATGACTGC AAATCCCAAG
ACTGTCACAG GCGACACCAT GGCTAAAGAG TGTCTTGACA TTCTTGAAAC CTGGCGCATT
ACACAGTTGC TGGTGTGCGA CGATGAACAG CATCCTGTTG GAATGGTGCA TATTCATGAT
TTAATTGTAC TGGGGTTATA G
 
Protein sequence
MSIPSQKQEI TALGKIILDQ EARAIHLIAE RLDESFSAAV ELLASCQGKI IISGMGKSGI 
IAQKIAATMS STGSTALFLH PADAAHGDLG IVGHTDTVIC LSKSGTTEEL NFIIPALRQI
GAKIIAMTGN PRSFLAQKAD ITLDTGIAKE ACPYDLAPTT STTAMLAMGD ALAIALMQVK
NFTQRDFALT HPKGSLGRRL TVKVSDIMAK GDAVPIVSES ASVTGLILEM TSKRYGVSAV
ITDDGKLCGI FTDGDLRRLV QSGREFLNLS AGSVMTANPK TVTGDTMAKE CLDILETWRI
TQLLVCDDEQ HPVGMVHIHD LIVLGL