Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ping_0029 |
Symbol | |
ID | 4624959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Psychromonas ingrahamii 37 |
Kingdom | Bacteria |
Replicon accession | NC_008709 |
Strand | + |
Start bp | 43781 |
End bp | 46711 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 639795222 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_941503 |
Protein GI | 119943823 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins [COG5351] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.271954 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGTA TTAAAAGCCT GCAGGCTTCA TTATTAACCA ATACTTTTAA GTATCAGCAG AAGCACTTTT TTGTTGCATC GACTTTATGG GGCTTCTATC TCGATTCGGG TGAGGCTGTC CTGGAGCAGA CATTATGGCC CATTATCGCA GCTAATCTGG GTAAAGATAG TCTGTTTGAT GAGGCGTTCA AGAAAGACGC CAGTGAATTT GTGGTGTTTG CCGATGCCTT TGCACCTCAG CTGCAACCCG CTCCTCGAGT GGATGTGTCA GTTAGTTTGG CCGAGATTAC AAAAATCTTA AATGTCTATG GTGAGCGCTA CTGGCAGGGT TTTGGGCCCT CTCCGGCAGA TCCTTTTATG CAAATGCCGC TCAGTTATGA GCATGCCTAT GGCGGACAAG CGCACCTTTA TAATAAAGAC GGAAAAGGCC TGCCCGGCAA TGATTCGAAT GAGGTTTTAC GCTTTTTAGC CAACATTGAG CACCCTGATT TTCCTGTCAT TTCGAAAAAA ATTCCGGATA ATAAAAGCGC CTATACCCCC CAGGGTTTTG CTGCTATTAA TAGCGAGTGG CCACAGAGAA AAAATTTATA TGGCACCTTT GATGATAATT ATCTGCAAAA CCATATGCCG GGCTTAGCGC CGGATATTAA TTGGGATTAT TTTCACACTG CGCCGCCGGA TCAACGCTTT GACCGTTATT TAACCGGCAA TGAACCCTTT TGCATAACCA ATATGCATGA AACCATGGCT GAAATTAAAG GGAACTTACC CGGGGTTGTC GGTCGCTGTT TTATCGAGCA GGAACTCCCC TTAAACGCGC TTGAGCAAGG AGAGTTAGCT CAGTATTCAG ATGCCCAAAA ACGTGATGAT AAATTGGTGC TTTTTAAGGA GATCCCCCTA AATCTAGACA CGGTTTATTT TTTCCCCAAT GACAATATCG GCATTGTTGT GCACCGCGGC ACGATCGAAA TTAATCATCC TCAGGCTAAA GATATTAAAA AGCTCTTAGT GGCGCATCAA AGTTTAACTG AACCGGCTAA ACCCAGCGCT TTTTATCAGG ATCAACTGCA GCTGCGCTGT GACCCTGAAG ACGGGTTTAA GTATATGATG TATTCGGCGC CTTTAATCCC TGAAGATGTG ACTTGCGGTT TTAAACAGCT TCTAGGCGAT GAAGAATATA AGCAGGTCAT GGGCGATAAT CTGAGCGCCT TTGCTGAGGG AAAAAAACAG CAGGCCGAGC AGGAAATGGA TGAGGCTATC GACAAACAAG TCGCGGAATT AAGAGCCAAT GGCATGGACA AACAAGCCGA TGAGTTATTA GATAAAATAA AAAATCCGCC TCAGGATATT GAATTGCCCG AAGATGCCAA AAAACTGCAG GCCTTAACCG ATAAAATATT ACCGGGTATA AGCGCTATGA AAGAGGCACC GAAACTTGAT GATCTTGACC TGACCAAGCT AAATCTCAAA GCCATGGATG AAGTGCAGGC ACATATGGAG GCGATGGCCG AAAAACAAAA AAAAGAAGCG TTATTAAAGG TCGAGCAGCA ACTTGACGAA TTAAAGCAGC AAGCCGCGCA GCAGCCTGAA ATGGCAGAGC AGCTCGACCC GTCAATAAAA CAGCTTGAAG AGATGTTGGC AAGTATTGAT GCTATTCCGG TCTTAACCCG TCCCGATACT GTTGAGCAAG ATACTCAGCT AAGTGCTCAG CTTGCACAAG CCGCAGAGCA GTTGACCGAG CAGAAAAAAA TGATGGCGGA GCATAATATT GAGTTAAGCG CGGAGCAGCA ACAGCAAATG ACCGAGCTCG AGCAGCTGTT AGATAATAAT GAAACGCTGT CGGCGCAAAT GGCAGAGGCC AATGACTTTA TTAATGACAG TTATTTAAAA GGGGCGCATT TTATAGCAGA GTCCAGCTCT CCCCATAAAG GTAAAGAGCC TGAGCTGGTG GCGGCGCTGT TAGATAGCTA TAATGCAAAA CAAGTGATCA CCGCGAAAGA TTTCGCCTTT TGTAAACTAA CGCAGCAGAA ATTCAAACAA ATAGATTTAA ACCATGGTTT TTTTGAATAC AGTGAATTAA CTCATATTGA ATTTGAACAG GCTGATTTAA CGGCCATTAA TTTAGCTCAT GCCAAATTAT CGAATGTGAG CTTTGATAAC TGTGCGATTG AAGGGGCAAA TTTAGGCGCG GCAGAGCTAA CCGCTTGCCA GTTTAAAAAT ATGAAGCTGA TTGAGATTAC TTTGGCGCAT TCGAGTTTAA TTGATTGTGA ATTTACCGAC TGTGAGTTTG GTGAGCGCAT GGATATGCTC CTGGAAACCA AGCTGCTCCG TTGTAAATTT GTAAATTGCA CCTTTTTGAA ACTGAACTTT ATTGAACTGG ATTTAACCGG CTGTGAGTTT ATTGATTGTG ATTTAAGCGA ATCTAATTTT ATTAAACCTA TTTTGACCGA TGCTAGCTTT ACAGGCTCGA CCTTAAATGG CACCAACTTT GTAATAGCAG AGCTTAATAA CAGCTGTTTT GAACGGGCAA GAATGAAAAA TACCCGTTTT GTTGGCGGGT GTTTATTAAA TGAGGCCTGT TTTAATTTTG CCACCATTAA TGAAACCAAT TTACGGGATT GCCAGTTAAA CAACTGTGAT TTTTCCGATG CGGATATCAG TAAAAGCGAT TTTGGTGAAT CGAGTATTAA AAACAGCCAA TTCAACTGCA CTATTGCCAG GCAAGTACAG TTTATCGACA GTAACTTGAA TGGCAGCCAG TTTAAAAAAG CGGATTTTAT GGAGGCTAAT TTAATGCAGG CGGATATTCG CGGCTGTAAT TTTTCAGGCG CGAATCTTTA TGGTGCCAGT TTTTTAAATG CCACTCTGGG CAGTACTTCA TTTTATGGCG CTATATTAGA AAACACCCTA TTAAAAGACT GGAGACCCTA A
|
Protein sequence | MKSIKSLQAS LLTNTFKYQQ KHFFVASTLW GFYLDSGEAV LEQTLWPIIA ANLGKDSLFD EAFKKDASEF VVFADAFAPQ LQPAPRVDVS VSLAEITKIL NVYGERYWQG FGPSPADPFM QMPLSYEHAY GGQAHLYNKD GKGLPGNDSN EVLRFLANIE HPDFPVISKK IPDNKSAYTP QGFAAINSEW PQRKNLYGTF DDNYLQNHMP GLAPDINWDY FHTAPPDQRF DRYLTGNEPF CITNMHETMA EIKGNLPGVV GRCFIEQELP LNALEQGELA QYSDAQKRDD KLVLFKEIPL NLDTVYFFPN DNIGIVVHRG TIEINHPQAK DIKKLLVAHQ SLTEPAKPSA FYQDQLQLRC DPEDGFKYMM YSAPLIPEDV TCGFKQLLGD EEYKQVMGDN LSAFAEGKKQ QAEQEMDEAI DKQVAELRAN GMDKQADELL DKIKNPPQDI ELPEDAKKLQ ALTDKILPGI SAMKEAPKLD DLDLTKLNLK AMDEVQAHME AMAEKQKKEA LLKVEQQLDE LKQQAAQQPE MAEQLDPSIK QLEEMLASID AIPVLTRPDT VEQDTQLSAQ LAQAAEQLTE QKKMMAEHNI ELSAEQQQQM TELEQLLDNN ETLSAQMAEA NDFINDSYLK GAHFIAESSS PHKGKEPELV AALLDSYNAK QVITAKDFAF CKLTQQKFKQ IDLNHGFFEY SELTHIEFEQ ADLTAINLAH AKLSNVSFDN CAIEGANLGA AELTACQFKN MKLIEITLAH SSLIDCEFTD CEFGERMDML LETKLLRCKF VNCTFLKLNF IELDLTGCEF IDCDLSESNF IKPILTDASF TGSTLNGTNF VIAELNNSCF ERARMKNTRF VGGCLLNEAC FNFATINETN LRDCQLNNCD FSDADISKSD FGESSIKNSQ FNCTIARQVQ FIDSNLNGSQ FKKADFMEAN LMQADIRGCN FSGANLYGAS FLNATLGSTS FYGAILENTL LKDWRP
|
| |