Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0387 |
Symbol | |
ID | 4241621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 600305 |
End bp | 603121 |
Gene Length | 2817 bp |
Protein Length | 938 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 638105714 |
Product | cadherin |
Protein accession | YP_720328 |
Protein GI | 113474267 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2931] RTX toxins and related Ca2+-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTAA ACACAAAAAG TATAGTAATA ATAGATGCTA GTGTAGAGAA CTACCAGCAA TTATTAAACG GAGTCATTCC CGGAGTTAAA CCTTTTCTCC TCGGCGGCGA CACTGACGGT ATCCAACAGA TAGGGGATAT CCTCCAAAAA CATCCAGAAA CGGATACTCT TCATATTATC TCCCACGGTT CTCCTGGTTG TCTGTATCTG GGAAATAGCC AATTGAGTTT GGATACTCTC AAAGGCTATG AGCCCCAACT ACAACAATGG CAACTGGACA ACCTTCTGCT CTATGGTTGT AACGTCGCTG CCGGGGATGG GGGAGAAGAG TTTATTGATA AGTTGCATCG GTTGACGGGG GCTGAGATAG CGGCTTCTAA GTCTTTGACT GGGGCGGCAG TTAAAGGGGG GAACTGGGAG TTGGAGGTGA GGACGGGCCA AACCAAACCC TCTCTGGCAT TGCAAGTAGA AGCGATGGCT AGCTACTCTG ATACTCTTGA ACTGCAATTT GAATGGGCTA AACATATCGG CGGTCAAGGT TTGACCTATG TTTATCGCAT AACCACAGAT AGCATCGGCA ATGTTTTGGT GGCGGGTGTT TTTGATGACA ATATTGATAT CGATGGCGAC GGGAACAATG ATTTTACCTC TAATAAGGAT AGCAGCACCC GGGATGCTTA TGTAGCCAAG TTCGACAGCA ATGGCAATTG GATCTGGGCT AAACAGATCG GCGGTAGGGG TTCTGAGGAT ATTAAAACGA TAACCACAGA CAGCAGCGGC AATGTCTTGG TGGCGGGTGG TTTTGGTGGC AACATTGATA TCGACGGCGA TGGAAACAAT GATTTTTCCT CTAATAATGA TTCCTCTAAT AATGATGCTT TTGCGGCTAA GTTAGACAGC AATGGCAATT TGGTCTGGGC TAAACAGATC GGCGGTAGCG TCACTACAGG TAACTATATT TCTGGCATAA TCACAGACAG CAGTGGCAAT GCCTTGGTGG CGGGTACTTT TAAGGGCAAC ATTGACATCG ACGGCGATGG GAACAATGAT TTTACCTCTA ATAATGATGG CAGCAACGAT TCTTATGTAG CTAAGTTTGA CAGCAATGGC AATTTGGTCT GGGCTAAACG TATTGGCGGT AGCAATGGTA ACTCTTCTGG TGCCATAACC ATAGACAGCA GTGACAATGT CTTGGTGGCG GGTGGTTTTA AGGGCAACAT TGATATCGAC GATGATGGGA ACAATAATTT TACCTCTAAT GGGGAGGCGG ATAGTTATGT AGCCAAGTTC GACAGCAATG GCAATTTTGT CTGGGCTAAA CAGTTCGGCA GTATCCGTGC TGACTATATT AATAGCATAA CCACAGACAG CAACGGCAAT GTCTTGGTGG GGGGTCTTTT TAATAGCTAC ATTGACATCG ACGGCGATGG GAACAATGAT TTTACCTCAA TTAGCGACAT CGATGGTTAT GTAGCCAAGT TAGACAGCAA TGGCAATTTG GTCTGGGCTA AACAGTTCGG CAGTAGCCTT GGTGGGGTTG TTAGGAGCCT GACCACAGAC AGCAGCGGCA ATGTCTTGGT GGTGGGTAAT TTTAGTGGTG ACGTTGACAT CGACGGCGAT GGGAACAATG ATTTTACCTC TAATGGTAAT GGTAACCATA CTTTTACAGC CAAGTACGAC AGCAATGGCA ATTTGGTCTG GGTTAAACAT AATCTACACA ACGCCTACAC CATAACCACA GACAGCAGCG GCAATGTCTT GATGGTGAGT GATTTTAATG GCAACATTGA CATTGACGGC GATGGGGACA ATGATTTTAC CTCTAATGGG GACAGGAATG GTTTGGTGGT GAAGTTCTCG GACGATACTA ACTCCCCTCC CACGGATATC GCCCTCAGCA ACAACACCAT CGACGAAAAC GTAGCAGCGA ATACCCCGAT AGGCAATCTC TCCAGCACAG ACCCCGATAC TGGAGATACC TTTACCTACA GTTTAGTCTC AGGTGCCGGA GACACCGACA ACGGAACCTT CAACATCAGC GGAGACGAAC TAACAATTAA AAGTTCCCCC GACTACGAAA ACAAACCGAG TTACAGCATC AGAGTTGAAA CCACCGATGC AGCAGGAGAA ACTTATCAGA AGGAATTAAC GATTAACGTT AACGACCTTT TTGACGGTAA CAGCGGCAAC GACATCCTCC GAGGCACCGC TGCCGCTGAT TCCATCTACG GTTTAGAAGG GCGCGACAAG TTGTACGGCA AAGGCGGTAA CGATATCATA TCTGGAGGAG CAGACAGCGA TATCGTTAAA GGTGACAGCG GTGACGACCA ACTCAACGGC GATGATGGTC GGGACAGGCT TTATGGTGGT ACCGGTAACG ATATCATATC TGGAGGAGAA GACAAAGACA TCGTTAAAGG TGACAGCGGC GACGACCAAC TTAACGGTGA TGCTGGTCCA GACAGGCTTT ACGGTGGCAC CGGCAACGAC ATTTTACTCG GAGGAGACGG CAACGACTTA CTTTATGGTC AAGGCGGAGA CGACGAGCTT AATGGCGGAC CTGGCATGGA CAGACTTTAC GGCAGTGACG GTATTGATAC CTTTGTCTTA GGGGAGGGCC AGGAACGCGA TAGCATCTAC AATTTCGCGG TCGGTACAGA TAAAATCAAG TTAGAGGGCA GTCTGAGTTT TGGCCAATTG CAGATTTTCC CAAGGGGTTC TTCTAGTTTA ATTGAAGTCA CTGCTACTGG AGAAGAATTG GCTATTGTGT TGGGGGTTAA TGCGGCGGAT ATTAACGACG GCAGCGTATT TATCTAG
|
Protein sequence | MKLNTKSIVI IDASVENYQQ LLNGVIPGVK PFLLGGDTDG IQQIGDILQK HPETDTLHII SHGSPGCLYL GNSQLSLDTL KGYEPQLQQW QLDNLLLYGC NVAAGDGGEE FIDKLHRLTG AEIAASKSLT GAAVKGGNWE LEVRTGQTKP SLALQVEAMA SYSDTLELQF EWAKHIGGQG LTYVYRITTD SIGNVLVAGV FDDNIDIDGD GNNDFTSNKD SSTRDAYVAK FDSNGNWIWA KQIGGRGSED IKTITTDSSG NVLVAGGFGG NIDIDGDGNN DFSSNNDSSN NDAFAAKLDS NGNLVWAKQI GGSVTTGNYI SGIITDSSGN ALVAGTFKGN IDIDGDGNND FTSNNDGSND SYVAKFDSNG NLVWAKRIGG SNGNSSGAIT IDSSDNVLVA GGFKGNIDID DDGNNNFTSN GEADSYVAKF DSNGNFVWAK QFGSIRADYI NSITTDSNGN VLVGGLFNSY IDIDGDGNND FTSISDIDGY VAKLDSNGNL VWAKQFGSSL GGVVRSLTTD SSGNVLVVGN FSGDVDIDGD GNNDFTSNGN GNHTFTAKYD SNGNLVWVKH NLHNAYTITT DSSGNVLMVS DFNGNIDIDG DGDNDFTSNG DRNGLVVKFS DDTNSPPTDI ALSNNTIDEN VAANTPIGNL SSTDPDTGDT FTYSLVSGAG DTDNGTFNIS GDELTIKSSP DYENKPSYSI RVETTDAAGE TYQKELTINV NDLFDGNSGN DILRGTAAAD SIYGLEGRDK LYGKGGNDII SGGADSDIVK GDSGDDQLNG DDGRDRLYGG TGNDIISGGE DKDIVKGDSG DDQLNGDAGP DRLYGGTGND ILLGGDGNDL LYGQGGDDEL NGGPGMDRLY GSDGIDTFVL GEGQERDSIY NFAVGTDKIK LEGSLSFGQL QIFPRGSSSL IEVTATGEEL AIVLGVNAAD INDGSVFI
|
| |