Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3147 |
Symbol | |
ID | 4244277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 4803403 |
End bp | 4804665 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 638108157 |
Product | von Willebrand factor, type A |
Protein accession | YP_722750 |
Protein GI | 113476689 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.03337 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGACC TAGAACTAAC CTGGGATAGA CCAGCAAAAC TTAACTCCCA ACAGGCAAAT TACCGAGACG GCAGTCACGT CCTACGAGTC CGCATTCAGC CTAAAACAGA TGCCAATTTG CCCAGTTTAC CAATACGGAT GGCGATCGCT CTTGATACCA GTCAATCAAT GAAAGGGGAA AAACTGCAAC GCGCTAAAGA AGCTTGTCTT GCCGTCGTTT CTCATTTACG GGACCCAGAC TACTTATCTC TAGCAGGTTA CTCTACCAGA GTCACACCTT TGCTGGAATC TCTAGCAGGT GGGGGTGCTG CTGCGGGATT CGCCGAAGGG GCGATCGCTG ATTTACAAGC AAGGGGTGTT ACTCGCATCG ACTTAGCATT AGATTGGATT GAAGAAAGTC TCCTTCCAGA AAAAAGCCCA CCACTCGTGG GGGTTTTAAT TACCGATGGT CATGCTACAA ATGCTGGAGG AACGCCTTTA GATGATATGA AACCCTTTAT TGTCAAAGCC AGGAACATGA AAAGTTGTGG CATTATCCTG TGTGCTGTAG GACTAGGTGA TGCTGCTAAC TTCAACACAT CCTTTTTGAC AGACTTAAGC GATCAAGGTG GAGGAGCATT TATCTATGCT GATACTCCAG ACAAGCTATT GAGCGATCTA CAAAACAGAT TAAAAGCGGA TCAGGAAATT GCCATTGTAG ATGCCAAACT TCACTTAACA CTATTAGCTT CCGGAGTTAA AGCGACAAGC TACTGTCGCT TCCGTCCAGA ATATTTACCC CTAGAAGAAA CCCGCCCCAA TGAATTAAGC CTGGGAACAC TCCGTAGGGA TTACCCTACA GATATTTTAA TTAGTTTGGA TCTTCCCTCC ATCAGTTTTG GCGAACCCCT AGGAAGTCGC GACATAATTT CAGTAGAACT AACAGCTAGG GGATTAGAAA CCCCCATAAA AAAAACAGCC GCTATTACCT ACACCACTGC CTACAGCGAA GCACAAAAAG TTAATACAGA AGTAAATCGT GATCGGCAAT GTTGGGAAAT CAACCTTTAC AGCAAAGAGA TGATAGACAT TGGTAATAGT AATCCTAAGC GCACTGGAGA GTTGCTAACA GAAATTCAAG TTACAGCAGC CAAAGCCGGG GAACTAGATC TTGCCAGCCA AGCCGCCCAA CAACTAGACG ATTTGCAAAA GACAGGAAAT TTGAATCCAG ACAAAGCCAC GGGAATATTG AGGGATTCGC GCAATCTTGG AAAGACAGAC TAA
|
Protein sequence | MFDLELTWDR PAKLNSQQAN YRDGSHVLRV RIQPKTDANL PSLPIRMAIA LDTSQSMKGE KLQRAKEACL AVVSHLRDPD YLSLAGYSTR VTPLLESLAG GGAAAGFAEG AIADLQARGV TRIDLALDWI EESLLPEKSP PLVGVLITDG HATNAGGTPL DDMKPFIVKA RNMKSCGIIL CAVGLGDAAN FNTSFLTDLS DQGGGAFIYA DTPDKLLSDL QNRLKADQEI AIVDAKLHLT LLASGVKATS YCRFRPEYLP LEETRPNELS LGTLRRDYPT DILISLDLPS ISFGEPLGSR DIISVELTAR GLETPIKKTA AITYTTAYSE AQKVNTEVNR DRQCWEINLY SKEMIDIGNS NPKRTGELLT EIQVTAAKAG ELDLASQAAQ QLDDLQKTGN LNPDKATGIL RDSRNLGKTD
|
| |