Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3411 |
Symbol | |
ID | 4244448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5221535 |
End bp | 5222911 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 638108394 |
Product | Xaa-Pro dipeptidase |
Protein accession | YP_722984 |
Protein GI | 113476923 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.141786 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAATTC CAACAAATCA TACTTATCTA TCTAAAACAC TCCGCAACCG ACGAGAAAAA TTAGCAAAAT TAATTGATTT TCCAACAATA CTTTGGTCAG GTAGTTCTAG TTCCCGCAAC TTTCCAGCTA ATACTTTTCC CTTCCGTCCT AGCAGTCATT TTCTTTATTT TGCTGGGCTA CCTATTGAAG ATGCTGCTAT CCGTTTAGAA GGAGGAAAAT TAGAACTATT TATGGATAAT CTATCCCCAA GTAATTTACT TTGGCATGGA GAAATACCAA CACGCGATCG CCTCGCCGAA ATTATAGGTG CTGATGCTGC TTTTCCGATT AAAGAATTAA AAGATTATGC CGCAAATGCA GCAACTATTT ATGTACAAAA TCCCACTACT AAAATCACAC AATGTCAGAT TTTAAATCGC GATATTTTTC CTTCTAAGAA ACACCAAAAA ATTGACTTAG AATTAACAAA AGCAATTATC TCTTTAAGAC TTAGCCATGA TGATATGGCT TTAACAGAAA TTAAACAAGC AGCAGCAGTA ACAGTAAAAG CTCATAAAGC AGGAATGGCA GCAACAAAAA ATGCTAAATT TGAAGCTAAT ATCCGTGCTG CAATGGAAAG TATTATTATT TCTCATAATA TGACCTGTGC TTATAACAGT ATTGTAACTG TACATGGGGA AGTTTTACAC AATGGAGAAT ATTATCATCC TCTACAAACA GGAGATTTAC TATTAGCAGA TGTGGGAGCA GAAACAACTT TAGGTTGGGC AAGTGATGTG ACTCGTACTT GGCCTATTTC TGGTAAGTTT TCTCCTACAC AAAGAGATAT TTATGATGTA GTTTTAGCTG CCCATGATAA TTGTATTGCT CAACTTAAAC CAGGTGTAGA ATATTTAGAT ATTCATTTAT TAGCAGCTAA AACTATCGCC GAAGGATTAG TTAATTTAGG AATCTTAAAA GGTCAACCAG AACAGTTAGT AGAAATGGAT GCTCATGCAT TATTTTTTCC CCACGGAGTT GGTCATTTAT TAGGTTTAGA TGTACACGAT ATGGAAGATT TAGGAGATTT AGCAGGATAT GAAATAGGTC GGGAACGTAG TAGTCGTTTT GGTTTAAGTT TTTTGCGATT AAATCGTCCG TTAGCTTCTG GAATGTTAGT CACAATTGAG CCTGGTTTTT ATCAAGTTCC AGCGATTTTA AATAACACAG AAACACGTCA AAAATATCAG CATATTGTCA ATTGGGAAAA GCTCAAACAT TTTTCTGATG TTCGAGGTAT TAGGATTGAA GATGATGTTT TAGTTACCAC AAAAGGTGCT GAAATTTTGA CTAAAGAATT ACCAAGCAAT ACAGATGTAA TTGAAAGTTT ACTGTAA
|
Protein sequence | MQIPTNHTYL SKTLRNRREK LAKLIDFPTI LWSGSSSSRN FPANTFPFRP SSHFLYFAGL PIEDAAIRLE GGKLELFMDN LSPSNLLWHG EIPTRDRLAE IIGADAAFPI KELKDYAANA ATIYVQNPTT KITQCQILNR DIFPSKKHQK IDLELTKAII SLRLSHDDMA LTEIKQAAAV TVKAHKAGMA ATKNAKFEAN IRAAMESIII SHNMTCAYNS IVTVHGEVLH NGEYYHPLQT GDLLLADVGA ETTLGWASDV TRTWPISGKF SPTQRDIYDV VLAAHDNCIA QLKPGVEYLD IHLLAAKTIA EGLVNLGILK GQPEQLVEMD AHALFFPHGV GHLLGLDVHD MEDLGDLAGY EIGRERSSRF GLSFLRLNRP LASGMLVTIE PGFYQVPAIL NNTETRQKYQ HIVNWEKLKH FSDVRGIRIE DDVLVTTKGA EILTKELPSN TDVIESLL
|
| |