Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3334 |
Symbol | |
ID | 4243505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5113872 |
End bp | 5115197 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638108319 |
Product | von Willebrand factor, type A |
Protein accession | YP_722910 |
Protein GI | 113476849 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1240] Mg-chelatase subunit ChlD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0867289 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGACG TAACCATTAC CCCCCATCGA GAATTTCTAG CGGCTGATAC CCCTGGACAA AAGCTGTTTG TCATGTTAAA ATTACGTCCA AACGCAATTG TTTCAGCAAG TCGTCCTTCA ACAACCTTCA CCTTTGTTAT TGATACCAGT GGTTCAATGT ATGATGATAG TGAAGTGGGG AGGCCGAAAA TTGATATTGT TGTTGAAGCT CTCGAACGCT TAGTTACTGA TATACAAGCA GATCCTCGCG ATCGAATTGC CCTAGTACAA TTTGATGATT CAGCATCAGT TTTGTTGCCC TTGACTGCTG CCACAGATAC TGTTACTCTC CAAAATGCCA TCTCCAAATT ACGAAGTTTT AGTGGTGGAA CAAGGATGGC TTTGGGAATA GAAAAATCCC TGAATTTATT GAAAGACTCT GTTCTCAGTA GTCGTCGCAC TCTCATTTTT ACTGATGGAC AGACAATAGA TGAAATTGAC TGTCGAGAAC TAGCGGTACA ATTTGCCCAA GCTGGAATTC CTATTACTGC TCTCGGTGTT GGTGACTACA ATGAAGACTT GTTAGTCTAT TTGAGTGATC ACACTGGGGG TCGCGTTTTT AATGTTGTGG AACAAGCCAG TAATACTGGA ACCACAGATA TAGCAATTTC TGAGCTGCCA CAGACAATTT TTCAAGAAGT ACAACAGGCT CAAGCTGAGG TCATTAATAA CCTCAAGCTT AGTGTTCGTA CTGTCAAAGG GGTTAATTTA CAAAGACTTA GCCGTGTTTA TCCAGACCGC GCTGATATTC CTGTTACTCA AGAACCTTAT CTCATCGGCA GCGCTCTCGC TAATGACGAT ACTATTTTTA TTCTTGATTT TGATATTGAT AGCAGAGCTC AATCACGGGT TCGTATTGCT CAATTAGGTT TAACTTACGA CATTCCCGGT CAGCAGCGAC GAGGAGAACT ACCCCCTCAA AATCTTGTTA TTCAGTTGGT TGCCGGAAAA GGTGGAATTG CCCAAACAAA TCCGGAAGTC ATGGGATATG TACAACAGTG TAACATTGGT CAATTAGTCG ATCATGCAGC AGCAGTAGCT GATAGCAACC CTGATGAAGC AGCAAAACTT TTGGAAACAG CAAAACGAGT AACTGTCAAA ATTGGAAATG AAGCCATGTT AAAAACTCTC AATCTTGGTA TTGAAGAAGT ACGCAAAACC CGTAAACTGT CTTCAGGAAC CCGCAAGACT GTAAAAATGG GTGCTAAGGG TAAAACTGTA AAAATGAGCG ATAGTCCTAA TGATCAACTT TCAGAAGAAC AAATCCGTAA TATGACAGGA ACCTAG
|
Protein sequence | MLDVTITPHR EFLAADTPGQ KLFVMLKLRP NAIVSASRPS TTFTFVIDTS GSMYDDSEVG RPKIDIVVEA LERLVTDIQA DPRDRIALVQ FDDSASVLLP LTAATDTVTL QNAISKLRSF SGGTRMALGI EKSLNLLKDS VLSSRRTLIF TDGQTIDEID CRELAVQFAQ AGIPITALGV GDYNEDLLVY LSDHTGGRVF NVVEQASNTG TTDIAISELP QTIFQEVQQA QAEVINNLKL SVRTVKGVNL QRLSRVYPDR ADIPVTQEPY LIGSALANDD TIFILDFDID SRAQSRVRIA QLGLTYDIPG QQRRGELPPQ NLVIQLVAGK GGIAQTNPEV MGYVQQCNIG QLVDHAAAVA DSNPDEAAKL LETAKRVTVK IGNEAMLKTL NLGIEEVRKT RKLSSGTRKT VKMGAKGKTV KMSDSPNDQL SEEQIRNMTG T
|
| |