Gene Ppha_1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_1997 
Symbol 
ID6462979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp2084246 
End bp2086471 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content49% 
IMG OID642728196 
ProductTonB-dependent receptor, putative 
Protein accessionYP_002018826 
Protein GI194337032 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TAGCTCTTCT GGTTCTTCTC CTTGCTGTTT CCGAAATGGC TTCAGGCACT 
GAAATCCCTC TAAATTCTTC GGCAAACGGA GCAAGTGAGG TCAGCGCCAC TGAAATAACC
GTGACCGGGA AGAAAGGCGA CATTCTGCAA CGCGTAACCG GAAAGGAGTC GGAGCTGCTG
AATCCCTCAC AGATGTCAGT CTACAAGGCG ATCAATCTGA TGCCCTCGCT CAGCCAGCAG
AGTGTTGATC CCTATGGGCT TGCCGATATT GTCAACTATC ATGAATCATT TCGTTTCCGG
GGTGTTGAGG CAACATCCGG TGGTGTTCCG GCCACAACGG TGAACGTTGA AGCTCTTCCG
TTAACGGGAA GACCAGGTGG CGGCGCAACC ATTTACGACC TTGAGAATTT CAGCAATATC
AACATCTATA CCGGTGTCAT GCCCGCCAAT GCAGGACTGG GGCTTGCTGA TGTCGGCGGC
AAAATCAATA TGGAAATCCG TCGCCCCGAA GAGAGCTTTG GTGTGCTTCT CAAGCAGGGA
ATCGGCAGTC AGAATTTTTA CCGCACCTTT ATGCGTATTG ATTCCGGCTC GCTCCCTGGC
AAGGTCAAAA GCTTTATCTC ATTTTCTGAC AGTGCTGCGG ACAAATGGAA AGGCGAGGGT
AACAGCGAGA GAACCAACGT CATGGCAGGA GTGACAAAAG AGTTCAGCGA CAACGTCAAA
CTTGAGACAT TTGTAACCTA CAGCAAGGGA GATATCCATG CCTACAAGCC ACTGAGCTAT
GCTCAACTCT GTAACCCTGA AAGCGCCTAT ACGAATGATT ACGGCACAAA TCCTGACAGT
TATGACTATT ACGGCTACAA CCGTAACAAA TTTGAGGACT GGATGGTGAT GGCCAACCTC
GAGGTCAAAA CAGGCGAGCA TTCGAAACTC AACGTTAAAC CCTACTACTG GAGCGACAAG
GGATACTATC TCGAAACCAT TACCCTTGCC AATAGCCAGA ACCGGATCAG ACGGTGGGAC
ATCGACCACG ATCTCAAGGG AGTTCTTGCC GAGTACACAA CCAGACTCTC CGATATCGAT
CTTGATTTTG GCTACCTCTA TCATACACAG GTTCGTCCCG GTCCACCGAC TTCGTGGAAA
AACTACAAGG TCGTCAACGG CAAGCTGGTT TTTGATCAGT GGAACATTCT CTCAAACAGT
TCAAGCCATG AACTGCACTC CCCTTTTCTT GAGGCGACAT ACCGCTTCGG CGCCTATAAG
CTTGAAGGAG GGGTAAAATA TATCAACTAT ACGCTGCCAT CGATCATTAC CTGCAACACA
ACGGGGGTTG GCGACCTCAG TTACGATGCG GCTCTTGCCA GTGACCCGGC GATCAATACC
AAAGCCAGCG CCCTGGATAC AAAAAGCTTC AGCAGGCTCT TTCCCAATCT GACCCTTACA
AGAACTGTTG GCGATAATGC CACCATTCAT GCCGCATACG GGGAAAACTA TGTAACACAT
GTCGATATCT ATCCCTACTA TATCTCACAG TTCAGCAGTT TTGACAGCAA GGGCATCACC
TTTCAGCAGC TCTGGAGCGA GCGGGAGATG GAAACATCAA AAAATGTCGA ACTCGGCATG
AACGTACAAG GCAGCAACTG GAGCATTGCC CCAACCATCT ACTATGCGCT GCACAAGAAC
AAGCAGGCTG TGCTCTATGA TCCGGCGCTC AATGCAATAT ACCCGATGAA CAATGCCGAT
GCCAGAGGTT ACGGGTTTGA ACTGGAAGCC GAGTACAAGC CTGTTGATAA CCTGAGTTGT
TACGGGTCAT TCTCGTGGAA CAGATTTTCT TTCTCCCAGG AGATCAACTC CGATGCTCCG
GGAGGAGGAA TCATCAAGGT GAAGGGCGAA CAGGTTCCCG ATGCTCCCGA ATTCCTTGCA
AAAGGGATGG TCAGCTACAA AACCGGAAAC CTGACCATAT CGCCCATTGT CCGATACACC
TCTGTTCGTT ATGGCGATGT GCTGCACAAG GAAAAGATTG ACGGAACAAC ACTTTTTGAC
CTCGATCTTA CCTGGAGCAG AAAAATGCCC GGTTTTAAAC AGGTCGATTG CTCACTCTCT
TTCCTGAACA TTTTTGACAA GCAATATGTC AGCATGATCA GTACATCGGA CTACAAAACC
CTGAAAACAT CCTATCAATC CGGAGCGCCA TTTACCATGT TGGCAACGGT TGCATTCCAT
TACTGA
 
Protein sequence
MKKIALLVLL LAVSEMASGT EIPLNSSANG ASEVSATEIT VTGKKGDILQ RVTGKESELL 
NPSQMSVYKA INLMPSLSQQ SVDPYGLADI VNYHESFRFR GVEATSGGVP ATTVNVEALP
LTGRPGGGAT IYDLENFSNI NIYTGVMPAN AGLGLADVGG KINMEIRRPE ESFGVLLKQG
IGSQNFYRTF MRIDSGSLPG KVKSFISFSD SAADKWKGEG NSERTNVMAG VTKEFSDNVK
LETFVTYSKG DIHAYKPLSY AQLCNPESAY TNDYGTNPDS YDYYGYNRNK FEDWMVMANL
EVKTGEHSKL NVKPYYWSDK GYYLETITLA NSQNRIRRWD IDHDLKGVLA EYTTRLSDID
LDFGYLYHTQ VRPGPPTSWK NYKVVNGKLV FDQWNILSNS SSHELHSPFL EATYRFGAYK
LEGGVKYINY TLPSIITCNT TGVGDLSYDA ALASDPAINT KASALDTKSF SRLFPNLTLT
RTVGDNATIH AAYGENYVTH VDIYPYYISQ FSSFDSKGIT FQQLWSEREM ETSKNVELGM
NVQGSNWSIA PTIYYALHKN KQAVLYDPAL NAIYPMNNAD ARGYGFELEA EYKPVDNLSC
YGSFSWNRFS FSQEINSDAP GGGIIKVKGE QVPDAPEFLA KGMVSYKTGN LTISPIVRYT
SVRYGDVLHK EKIDGTTLFD LDLTWSRKMP GFKQVDCSLS FLNIFDKQYV SMISTSDYKT
LKTSYQSGAP FTMLATVAFH Y