Gene Ppha_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_2039 
Symbol 
ID6462865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp2131061 
End bp2132263 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content49% 
IMG OID642728234 
Productprotein of unknown function DUF1016 
Protein accessionYP_002018864 
Protein GI194337070 
COG category[S] Function unknown 
COG ID[COG4804] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.454734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTATA AACAGTTACT TGCCCTCTTC AAAGAGACCC ATCAGGAGTT ACAGCAAAGA 
GCCGCCCGCT CGGTCGATAC CTCCCTGGTG ATCCGGAACT GGCTGTTCGG GTGGTACATT
GTAGAGTTTG AGCAGGGCGG CTCAGACAGA GCGGAGTATG GCGCCAATTT GCTAAAAAAA
ATCGCGGCTC AGTTGACGAT CAAAGGCTGT TCAGAACGAA GCCTCGCGCT CTGCTGTAAG
TTCTATCTCA CCTATTCTGG AATTTTGCAG GCACTGCCTG CAAAATCTGA AAGCAGGCAG
AATGAGTTCC AAAAGATTGG GCAGACACTG CCTGACCAAT CTTTTCGTGA GCAAAGTGAA
CTACCGGAGA TTCAACAGGC ACTGCCTGTT ACATCTTTTG ATGCCATAGC CAGTGCTCCC
AAAATGGTTC AGGAACTCTC CGAAACATTG GCTGGCTGCT TTTCTCTCGG ATGGACACAT
TACGTTGCTT TGCTGACCAT ATCGAACACT GATGAGCGCC GATTCTACGA AATTGAAGCC
AGCGAAAACA GTTGGGGTGC CCGAGAGCTT GAGCGGCAGA TAGCGGCCTC GCTGTATGAG
CGGCTGGCAC TCAGTCGTGA CAAGGAGGGA ATCCGGCAGC TCTCAGAGAA GGGGCTGATT
ATTGAAAAAC CGGCGGATGT GATCAAAAGC CCCTTTGTGC TTGAGTTTCT GGATCTGGAA
GAAAAAACCG CTTATTCGGA ACATGCACTT GAAACGGCCA TTATCGACCA CCTCGAACAC
TTTCTGCTTG AACTGGGCAA AGGGTTTCTC TTTGAGGCTC GCCAGAAACG GTTCACCTTC
GATAACGACC ACTTTTATGT TGATCTGGTT TTTTATAATC GGCTCTTGCG CTGCTATGTG
CTTATTGACC TCAAGCGCGA CAAGCTGACG CATCAGGATC TTGGGCAGAT GCAGATGTAT
GTGAACTACT TTGACCGCTA TGTCAAAACG GAGGATGAAC TGCCGACCAT TGGCATTCTG
TTGTGCCATC GCAAGCATGA TGCGCTGGTT GAACTGACAC TCCCCAAGGA TTCAAATATT
TTTGCATCAA AGTATCAGCT CTATCTGCCC TCAAAAGAGG AGCTGAAAAG GCAACTGGAA
GAGGCTGCAG GCATTGGGCA CTGCGAAAAT CCCGATCAGG AGGGAATGAA CGATGTTCGA
TAA
 
Protein sequence
MNYKQLLALF KETHQELQQR AARSVDTSLV IRNWLFGWYI VEFEQGGSDR AEYGANLLKK 
IAAQLTIKGC SERSLALCCK FYLTYSGILQ ALPAKSESRQ NEFQKIGQTL PDQSFREQSE
LPEIQQALPV TSFDAIASAP KMVQELSETL AGCFSLGWTH YVALLTISNT DERRFYEIEA
SENSWGAREL ERQIAASLYE RLALSRDKEG IRQLSEKGLI IEKPADVIKS PFVLEFLDLE
EKTAYSEHAL ETAIIDHLEH FLLELGKGFL FEARQKRFTF DNDHFYVDLV FYNRLLRCYV
LIDLKRDKLT HQDLGQMQMY VNYFDRYVKT EDELPTIGIL LCHRKHDALV ELTLPKDSNI
FASKYQLYLP SKEELKRQLE EAAGIGHCEN PDQEGMNDVR