Gene Paes_0102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_0102 
Symbol 
ID6458572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp99809 
End bp102649 
Gene Length2841 bp 
Protein Length946 aa 
Translation table11 
GC content53% 
IMG OID642724089 
ProductTonB-dependent receptor 
Protein accessionYP_002014809 
Protein GI194332949 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.436886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTGT TTCACGGAAA CAACAAAACA GATAACCTTT CAGCGCAGCA AATCCTGTAC 
CTGTACCAGA GGTTTTTTAA CCGAAACAAG CCCATCATGA AATACTCATT TCTGCTGTTA
ATCACATGCG TTTCGACGCT TATGATCAAC AGCCTGTCGA TTTTACTTCC CTCAGGAGCT
GCCGGGGCGG CAACGATCAC CGGCATCGTG GTCGACGACG CCGACGGACT GCCGCTCCCT
GCAGCAACCA TCTCTGTCAA AGGCTCTGAA GAAGGCGCCA TAACCGGTCA GAACGGTCGC
TTCCGGCTGG AAAATGTCGC CACCAGTAAC CCCGTTATTG TGGCATCCTA TCTTGGCTAC
ATGACCGAGG AATACCCTGT ACTGCTCTCC TCTGAAGGTC TGGCAAAACT CTCCATCCGG
CTCAAACCAG GCATTGTGGT CAGCCAGGAG ATCACCATCG TAGGCGAAAT GCTCAAAGGC
CAGGCCAAAG CCCTCAACCA GCAGAAAAAC AATCTCAATG TCACCAATGT GGTCGCAGCG
GACCAGATCG GCAAGTTTCC TGACTCGAAC GTAGGCGACG CACTCAAACG CATTCCGGGC
ATCAGCGTCT TCACCGATCA GGGAGAAGCT CGCTTCGGCC ATATCCGCGG CACCGAACCC
CGTTTCAACT CCGTCACCGT CAACGGAGAG CGCATTCCAT CAGCCGAAGC TGAAAACCGC
ACCATTCAGC TCGATCTCAT CCCGGCAGAC ATGGTCCAGA CTATCGAAGT GACCAAAGCC
CTCACACCCG ACATGGATGC GGATGCAATC GGCGGCTCGA TCAATCTTGT CACCAAACTC
CCCACCGAAG AGCGATTTTC TCTCACCGCG GGCGGAGGCT GGAACATGAT CGACGAAACA
GGCGGAGCCC GCTACCAGCT CGGCGGAACC TATGGCAACC GCTTCCTTGA CGGCAAACTC
GGCGTGCTCT TCAGCGTCTC ATACGACGAT AACGATTTCG GATCAGACGA CATCGAAGCC
GAATGGGATG CCGAAGAGGA CGGGATTGAG GCTCTGAAAG AGTTTCAGGT CAGACAATAC
GATGTCCGCC GCATCCGGAA AAGTTTCTCG ACCGGGCTCG ACTACCGGTT CAACGAAAAC
CACGTCCTGA AATTCAACGG GATCTACAAC TGGCGCAAGG ACTACGAAAA CCGCTACCGG
GGAAGCTACA AAGACCTCGA CGAGGACCTG GCAGAGCTGG TATGGGAAAC CAAAGGAGGA
ACGAACAACA ATGCCCGCCT CGAAGACCAG CGCATGATGT CGTTCACGCT CGGCGGTGAA
CACGATTTCG GTAAACTCGA CCTCGACTGG CAGGCAGCCT ATTCCAAAGC CTCCGAGGAG
CGCCCGAACG AACGTTATGT CTCCTTCGTT GCCGAAGACC AGCCATTCAT CACTGACATC
AGCAACCCGG AACACCCCTT GGTCACGGTG AACAGTAACG TCGCCGACGG AATTTCCGGC
AACGGATCAT GGACCCTCGA TGAACTCACC GAAGAGTATC AATACACCGA GGATATCGAT
AAAAACTTCG CACTGAACCT TGCCTACGAC CTGTCAGACT CTTTCAAACT GAAGTTTGGC
GGAAAAATTC GCGACAAGCA TAAAATGCGT GAAAACGATT TCTACGCCTA TGAACCTGCA
GCCGAAGAAG CGTTTTACAC TGCTGTCTAT GACAACCTTT CGGACAAAAC CAAAGAGGGC
TTTCTCCCCG GCGAGCAATA TGAATCCGGC AGTTTCGTGT CCAATGATTT TCTCGGAAGT
CTCGACCTGA ATTCCCAGGA CTTCGACAAA GAACGCGTAC TCGAAGAGCT GGCAGGCAAC
TTTGACGCGA AAGAACAGAT CAAAGCGGTC TACCTCATGG GAAGCTGGGA TCTGAGCCGG
AAAGCAACCA TTCTCGGCGG CGTAAGGCTC GAACACACCC GCAACGAGTA TGACGCTTAT
GAGTACAATG CCGACGAAGA TATCCTGACA AAGGTAACCG GCACCCCGTC AGACTATACC
AATGTCCTTC CCTCTATGCA TCTTCGCTAC AAGATCAACG ATATGACCAG CCTCAAGCTT
GCCTATACGC ATACGCTCTC AAGGCCGAAC TACTTCGATC TTGCACCGTA TACCCTCATC
GACGATGAAG AAAAATACAT TGGCAACCCC GATCTCGAAC CGACGATGTC GAAAAACGTC
GATCTCATGA TCGAGCACTA CCTGAGCGAT GTCGGCATCC TGTCGGCAGG CGTTTTCTAC
AAATCGGTCA GTGATTTCAT CATCACAAGA AAAACAGACG ACCCCGAGTA TGAAGACGGT
CTCTTTCAAC CACTGAACGC TGGTGACGGA ACCCTTACGG GCCTCGAAAC AGCAGCGCAG
TTTCAGTTGC CCTTCATCCC CGGGCTCGGC CTCTTCCTGA ACTACACCTA TACGCACTCA
ACGATCGACA ACTTCAACAT CGAAGGGCGC CAGAGCGATG GGCTCCCTCT TCCCGGCAGC
CCCGAGCACA CCGCAAACGC CTCCGTCGCC TATGAAAAAG GGCCGTTCAA CATCCGCCTC
TCGGCGAACT ACCACAGTGA CTTCATCGAT TCCGAGGAAG GATCGATCGG GGAAAACAAG
TGGGAAGACC GCTACTGTGA CGACGCATTC CATCTCGACC TCAACGGGGG CTACCGCCTG
AACGATATTG TCCGCCTCTA CTTCGAGGTC AGCAACCTGA CCAACGAACC GTTGCGATTC
TATCAGGGCG GGGAAAGCTA TATCGCTCAG GAAGAGTGGT ATGAACGAAG GATTCTTCTT
GGCCTCAAAG CAAACTGGTA A
 
Protein sequence
MTLFHGNNKT DNLSAQQILY LYQRFFNRNK PIMKYSFLLL ITCVSTLMIN SLSILLPSGA 
AGAATITGIV VDDADGLPLP AATISVKGSE EGAITGQNGR FRLENVATSN PVIVASYLGY
MTEEYPVLLS SEGLAKLSIR LKPGIVVSQE ITIVGEMLKG QAKALNQQKN NLNVTNVVAA
DQIGKFPDSN VGDALKRIPG ISVFTDQGEA RFGHIRGTEP RFNSVTVNGE RIPSAEAENR
TIQLDLIPAD MVQTIEVTKA LTPDMDADAI GGSINLVTKL PTEERFSLTA GGGWNMIDET
GGARYQLGGT YGNRFLDGKL GVLFSVSYDD NDFGSDDIEA EWDAEEDGIE ALKEFQVRQY
DVRRIRKSFS TGLDYRFNEN HVLKFNGIYN WRKDYENRYR GSYKDLDEDL AELVWETKGG
TNNNARLEDQ RMMSFTLGGE HDFGKLDLDW QAAYSKASEE RPNERYVSFV AEDQPFITDI
SNPEHPLVTV NSNVADGISG NGSWTLDELT EEYQYTEDID KNFALNLAYD LSDSFKLKFG
GKIRDKHKMR ENDFYAYEPA AEEAFYTAVY DNLSDKTKEG FLPGEQYESG SFVSNDFLGS
LDLNSQDFDK ERVLEELAGN FDAKEQIKAV YLMGSWDLSR KATILGGVRL EHTRNEYDAY
EYNADEDILT KVTGTPSDYT NVLPSMHLRY KINDMTSLKL AYTHTLSRPN YFDLAPYTLI
DDEEKYIGNP DLEPTMSKNV DLMIEHYLSD VGILSAGVFY KSVSDFIITR KTDDPEYEDG
LFQPLNAGDG TLTGLETAAQ FQLPFIPGLG LFLNYTYTHS TIDNFNIEGR QSDGLPLPGS
PEHTANASVA YEKGPFNIRL SANYHSDFID SEEGSIGENK WEDRYCDDAF HLDLNGGYRL
NDIVRLYFEV SNLTNEPLRF YQGGESYIAQ EEWYERRILL GLKANW