Gene Ppha_1220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_1220 
Symbol 
ID6463301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp1278990 
End bp1280315 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content48% 
IMG OID642727468 
Productpentapeptide repeat protein 
Protein accessionYP_002018109 
Protein GI194336315 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAACCG TTGAACTACT TCTCGGATCA GTAACAGAAT GGAACGCAGC TCGAAAGGCA 
CATCAAAAGG GCAGGCCCAA TCTCAAAGGG GCGGATCTCA GCGGGGCACA GCTTAACAAG
GCAGACCTCA GTCGTACAGA CCTGGTTGGG GCAAACCTCA GAGGGGCAGA CCTCAGCGGG
GCACAGCTCA ACATGGCAGA CCTCAATAGG GCAGACCTTA ACGGGGCGCA TCTCTATAAT
GCAAACTTCG GTAAAGCAAA CCTTATCAAG ACAAATCTGA GTAAAGCAAA CCTCAGCGGT
GCAACCCTAT GGGATGCCAA TCTCAGCGGG GCAGATCTCA GCGGGGCACA GCTTATATGC
GCAATTCTCA CCAATGCAAC CCTTACTGGG GCAAACCTCA CTGAGGCATG CCTTAACTCG
GCAGACCTCA CAAGGGCAAA TCTCATTGGG GGGGACTTCA CAAGGGCAAG TTTCAGCGGA
GCAACCCTCG ATGAAGTACA GCTTGCAGGG GCAGACCTTA CTATGGCATT CCTCGGTCAG
GCAAAGCTCT ACAGGTCAGA TCTCAGCGGG GCAAATCTAT GCGGCGCAAA GCTCAATAGA
GCAACCCTTA TTGAGGCAAA TCTTAGCAAG GCAGACATGC ACGGGGTAAT CATCTGGCAT
ACAATTTTTG TAAATGTAGA CCTTAGCAAC GTCAAAGGTC TTGACACTGT TCACCATGTG
GGTCCATCTA CCGTAGGGAT TGATACTCTC TGCATATCAA AAGGGAATAT ACCCGAGGTA
TTTCTGAAAG GCTGTGGTGT ACCAGATACC TTCATTGAAT ACGCGCACTC CCTCACCAGC
AAAGCTATTG AATTCTACTC CTGCTTTATC AGCCATAGCA CTGCGGATAA AGCATTTGCA
GATCGTCTCT ATGCTGACCT GCAAGCCAAA GGTGTTCGGT GTTGGTACGC TCCGCATGAC
ATGAAGGGAG GCAAAAAAAT ACACGATCAA ATTGGTGAAG CCATACGACA ACATGAAAAG
CTGCTGTTGA TTCTCTCCGA AAGCAGCATA AACAGTGACT GGGTAAAGCA GGAGATTATA
AAAGCAAAAA AACGTGAGGA TACAGAAGGA AAGCGAGTGC TTTTCCCCAT CAGTTTGATT
GAGTTTGGCA AGATTGAAGA ATGGGAGTTC CCTGACAGCA AAGGAAGGGA TTTAGCAGAA
GAAATCAGGT TGTACTATAT CCCGTCATTT ATAGGGTGGG AAAAAGACAA CGCAGCCTAT
ACAAAAGAAT TCGGAAAGCT GTTGAACTCA TTCCAGGCAG AGAAGGTCAC TGACGGAAAA
GCCTGA
 
Protein sequence
MLTVELLLGS VTEWNAARKA HQKGRPNLKG ADLSGAQLNK ADLSRTDLVG ANLRGADLSG 
AQLNMADLNR ADLNGAHLYN ANFGKANLIK TNLSKANLSG ATLWDANLSG ADLSGAQLIC
AILTNATLTG ANLTEACLNS ADLTRANLIG GDFTRASFSG ATLDEVQLAG ADLTMAFLGQ
AKLYRSDLSG ANLCGAKLNR ATLIEANLSK ADMHGVIIWH TIFVNVDLSN VKGLDTVHHV
GPSTVGIDTL CISKGNIPEV FLKGCGVPDT FIEYAHSLTS KAIEFYSCFI SHSTADKAFA
DRLYADLQAK GVRCWYAPHD MKGGKKIHDQ IGEAIRQHEK LLLILSESSI NSDWVKQEII
KAKKREDTEG KRVLFPISLI EFGKIEEWEF PDSKGRDLAE EIRLYYIPSF IGWEKDNAAY
TKEFGKLLNS FQAEKVTDGK A