Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2466 |
Symbol | |
ID | 5734346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3147967 |
End bp | 3153630 |
Gene Length | 5664 bp |
Protein Length | 1887 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641279605 |
Product | fibronectin type III domain-containing protein |
Protein accession | YP_001545232 |
Protein GI | 159898985 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.401987 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGACT CAGCAACGCT CTCTCGGCAT CGTATAGTTC GAACGATCAC GCTGCTTATC CTCACAAGTT TTATCCTCGG TCTTTTTGCA TTTACGCTTC AATCAACGGC TGCTCAAACT ACTCCAACGG CCTCGGATAA GGCCTATGGT TGGTTGCAAG CGCAGCAGTT GCCAAATGGA TTGGTCGATA GCTTCGAGCA CGGGGGGGCT GCCGATGATC TGTGTGTGGT CTATGACCAG GCGGTGGCAG CGATCGCCTT TGTGGTCAAG GCTGATTATG ATCGTGCTCG GGCGGTGCTG ACCGCTTTGC GTGGCTCGCA ATGGGATGAT GGCAGCTGGC ACAACATTTA TGCCTGTAGC AATCCCAATC AAGTGATGGA ATGGCATCGC GATGTCGGGC CAGCGGTGTG GGTGGCCTTG GCAGTGGCGA GCTATGAAGC CGCAACCGGC GATCTGGTGA CCTATCGCGA GATGGCGTTG CGAGCGGTGA ACTTTGGCTT TGGCTTTCAG CAGGATGATG GCGGGGTGAA TGGTGGCTTT GAGGCGACCC AACAAGGCTA TCGGTTCCAC AGCTGGGGAT CGACTGAGCA TGCGATCGAC CTCTATGCGG CGGCTCGCTA TTTTTATGGC GATCAACCAC GCACTGGGCA GGTGAAGCAA TTTCTTGAAA CAGTCGTGTG GGATTCGGTC GATGGTCGTT GGTTGGGTGG TCGTGCCGAT CTGCGTGACC CATTGGATGT GAACACCTGG GGTGTGGCCG CGCTGGGCGA CGAGTATATC ATGGCGTTGG AGTATGCATT GGCAACGCAT CGGGTGACCT TAGGCGGGAT CGATGCGGTG GACTTCAACA GTGACCGCAA TGATATTTGG TTTGAGGGCA CCGGCCAATT GATTGTAGCC TTGCAGGCGG TGGGCCGCAC CAGCGACGCG CAGTATTTTC TGCAACAAAT GGCGAAAGGT CAAAAGGCCA ATGGGGGAAT TCCCTATTCG TTGCAAGGAA CGAACAACGG CTATTGGACA ATGAGCACGG CGAATGCGGT GTCGAGCGCG GCCTGGTTGA TTTTTGCTGA TGCCGCGTTC AACCCGCTCG GTGTAGGCAC AGCCACGATC ACGAGTGTTG CGCCAACGCC TGATCCGGTG GTAGTGGAGC CGCCCGTAAG TGCCTCGGAT AAGGCCTATG GCTGGTTGCG CTCACAGCAA TTGGCGAATG GCTTGGTCGA TAGCTTCGAG CACGGGGGGG CTGCCGATGA TCTGTGTGTG GTCTATGATC AAGCGGTGGC AGCGATCGCC TTTGTAGTGA AAGCCGATTA CGATCGTGCT CGTGCCGTAT TGACCGCCTT GCGTGGCTCG CAATGGGGTG ATGGCAGTTG GCACAACATT TATGCCTGTA GCAATCCCAA CCAAGTGATG GAATGGCATC GCGATGTCGG GCCAGCGGTG TGGGTGGCCT TGGCAGTGGC GAGCTATGAA GCCGCAACCG GCGATCTGGT GACCTATCGC GAGATGGCGT TGCGAGCGGT GAACTTTGGC TTTGGCTTTC AGCAAGATGA TGGCGGGGTA AATGGTGGGT TTGAGGCAAC GCAACAAGGC TATCGGTTCC ACAGCTGGGG ATCGACTGAG CATGCGATCG ACCTGTATGC GGCAGCCCGC TATTTTTATG GTGATCAACC GCGCACCGGG CAGGTGAAGC AGTTTCTTGA AACGGTGGTG TGGGAAGCCG CCAAAGGTCG TTGGCTGGGT GGCCGTGCCG ATCAGCGTGA CCCGTTGGAT GTGAACACCT GGGGCGTGGC TGCACTGGGC GACGAGTATA TCATGGCCTT GGAGTATGCA TTAGGCACGC ATCGGGTAAC CTTGGGCGGA ATCGATGCGA TGGATTTCAA CAGCGACCGC AATGATATCT GGTTTGAGGG CACTGGCCAA TTGATTGTAG CGTTGCATGC GGTGGGCCGC ACCAGCGACG CGCAGTATTT TCTGCAACAG ATGGCGAAGG GCCAAACGGC GAATGGGGGC ATCCCCTATT CGTTGCAAGG GACGAATAAT GGCTATTGGA CGATGAGTAC GGCGAATGCG GTATCGAGCG CGGCCTGGTT GATTTTTGCT GATGCCGCGT TCAATCCGCT CGGTTTAGGC ACGGCGACGA TCACGAGTGT TGCGCCAACG CCTGATCCGG TGGTGGTGGA ACCGCCCGTA AGTGCCTCGG ATAAGGCCTA CGGCTGGTTG CGTTCACAGC AATTGCCAAA TGGCTTGGTC GATAGCTTCG AGCAAGGTGG GGCTGCCGAT GATCTGTGTG TGGTGTACGA CCAAGCGGTG GCGGCAATCG CCTTTGTGGT GAAAGCCGAT TATGATCGTG CTCGGGCGGT GTTGACGGCT TTGCGAGGCT CGCAATGGGG CGATGGCAGT TGGCACAATA TTTATGCTTG TAGCAATCCC AACCAAGTGA TGGAATGGCA TCGCGATGTC GGGCCAGCGG TGTGGGTGGC CTTGGCAGTG GCGAGCTATG AAGATGCGAC GGGCGATCTG GTGACCTATC GCGAGATGGC GTTGCGAGCG GTGAACTTTG GCTTTGGCTT TCAGCAGGAT GATGGTGGAG TCAATGGTGG CTTTGAGGCA ACCCAACAAG GCTATCGGTT CCACAGTTGG GGATCGACTG AGCATGCGAT CGACCTCTAT GCGGCGGCTC GCTATTTTTA TGGCGATCAA CCGCGCATCG GGCAGGTAAA GCAGTTTCTC GAAACGGTGG TGTGGGAAGC CGCCAAAGGT CGTTGGCTGG GTGGCCGTGC CGATCAGCGT GACCCATTGG ATGTGAACAC CTGGGGCGTG GCCGCACTAG GCGACGAGTA TATCATGGCG TTGGAGTATG CATTAGGCAC GCATCGGGTA ACCTTGGGCG GAATCGATGC GGTGGATTTC AACAGCGACC GCAATGATAT CTGGTTTGAG GGCACTGGCC AATTGATTGT AGCGTTGCAT GCGGTGGGTC GCACCAGCGA TGCGGCGTAT TTTCTGCACC AAATGGCGAA AGGCCAAACG GCGAATGGAG GAATTCCCTA TTCGTTGCAA GGAACGAACA ACGGCTATTG GACGATGAGT ACGGCGAATG CGGTGTCGAG TGCCGCATGG TTGATTTTTG CCGATGCTGC CTTCAACCCG CTAGGCCTGA TTGGTGATGA TCAAGCACCA ACTGCGCCTA CCAATCTAAC AGCTTCGTTA ATCACAGCGA CAACTGCCCA GCTTGCATGG AACGCAGCGA ATGACAATAT TGGGGTTGAA GGCTATGCAG TGTATCTCAA TAATGCTGCT ACTCCCGCAA CGGAATGTCG AGCCGTTCGG ACAACTCAAT GCACGTTGAC CAGCCTCAAT CCCTTAACTA CCTACAGCAT TACGGTCAAG GCCTTTGATG CGGCAGGCAA TGTTTCGCTT GCGAGTAATT CGATTAATAT AACTACACCT GATGTTGATC GGATTGCCCC GACGGCCCCG ACCAATCTGA ACGCTACCAA TGTAACTGCT GCGAGTATCC TGCTTAGTTG GACGGCTTCG AGTGATAATG TTGGCGTGGT TGGCTACAAT GCCTATCTTG GTACGAACCC AATTGCGGTT ACCAGTTGCA ATGGCGTAAT TACAACTGTA TGTAACCTGA CCGGACTTGC GCCTGATACG CAATACAGTA TTACGGTCAA AGCCTTTGAT GCGGCGGGCA ATGCTTCGCT TGCAAGCAAT AGCATCCTTG TGACAACCTT GATTGATACG CAAGCACCAA CGCCACCGTT AAATCTTCAG GTTCTCGCTA AAACCACCAC GACGGTTGAT ATAAATTGGA CGGCTTCAAG TGATAATGTT GGTGTGATTG GCTACAATGC CTATCTTGGT ACTAACCCGA TTGCAGTTAC TAGTTGCAAT GCTGTAACGG CAACCATGTG TAGCCTGACT GGACTTACGC CCAATACTCA ATACAGCATT ACGGTCAAAG CGTTTGATGC GGTGGGTAAT GTTTCTGCTG CCAGCGATGG CTTGGTGGTT CGTACCAACC CATTACCAAC CTTTGGTAAT CCTTGGTATC TGCTTGATGG CGCTACGCAC ATCACGCCGA GCCAACTCAG TACAACCAGT GGCAATGCCG CCGCCAGCGA TTCGATTCCA GCGGCGAGTG CAACTAACAT CGATGGTCAG GCGACTCAAA CCATCAGCTA TGAGGCCGAA AATCTCACGG CTAGCTACGA TAGCAGCAAA ACCACCGAGT TTGAGCTGTT TATTGATGCA GGCACGGCGG TTGGCAATGG TACACAAGCA CGCATCGCCT ACGATTTTAC TGGCGATGGC ACGTGGGATC GGATTGAAAC CTACAACTAT TTTGCAACTG ATCCGGTGAA TAGTTGGGAA CGTTATACCC ATACTCGTGG ATTACGCTCG GCAACAGGCA GCTTTGCCAA TTTGACCAAT GGCAAGGTCA AGCTTGAGCT TTGGAATGCA ATCGGCAATA ACCCATCGCT GATTCGCACG AATGCCAGCA GCGCCGATGG TCAGCAATCA ACCTTGGTTT TGCCATTCTT GGTAACAACT GTTGATAACG AAGCTCCCAC TGCGCCAACC AATCTGCTAC GCGGTGTGAC GACTGCTAAC TCAGCAGCCT TTACTTGGCA AGCCGCAAGC GATAACGTTG GGGTCGTCGG TTATTCAGCC TACCTCAATG GTAGTCAGGT TGCCGTAGCA AATTGTCAGA TGGTCACAGG CCTCGGCTGT AATCTTACTG GCTTGAGTGC CAATACCAGT TATAGCGTGG TGGTCACGGC CTTTGATGCT GCTGGCAATC AATCCGAGGC TAGCGCAAGC ATCACATTTA CCACCTCCGA CTTGGATTCG ATTCAGCCAA CAGCGCCAAC AAATGTTGCA ATTGCCAACA TAGGCGCTAC CACGGCTACG GTCAGTTGGA ATGCAGCGAC TGATAATCTG GGGGTTGTTG GTTATTCAGT CTATCTGAAT GGGGCAGATC AAGCGGCCAG CGGGTGCAGC ATGGTTGATG CGTTGAACTG TACGCTGAGT GGCTTGAATG CTGACACTAG CTATACATTG GTGGTCACAG CCTTTGATGC TGCTGGCAAT CAATCCGAGC CTAGCGCAAC TGTTACATTC ACAACGCAGG CTAATCCATT TCGTTCGCAG TTGTATCTGC TTGATGGCGC TAGCCAAACC GCTGACGGAA TCTTGAGTGT GATTCCGGGT AATGCGGCAG ATAGTGATCC GCTTGCGAGT GCCCAAAATG GCAACTGGGA TGGGCTGCCA ACCAATGCTA TCACCTATCA AATTAACGAC CTGACGGCAA CCTATGATCC TGCGAAAACC ACTCAATTTA ATCTCTACCT TGATGCAGGC ATCGGCGTAG GCAACGCCAG CCAAGTGCGT GTTTCCTACG ATTTTACTGG CGATGGCACG TGGGATCGGA TCGAAACCTA TAACTATTTC GCGACTGATC CGGTGGTGGG CTGGGAACTA TACAATCATA CACGTGGGTT GCGCTCAGCA ACGGGCAGCT TTGCCAATTT GACCAATGGT AGACTGAAAG TCGAGCTTTG GAATGCAATT GGCAATGGTG GATCGCTTGT GCGAACGAGT GCCACCACGG CTGATGGCCA GCAATCGACG CTGACGCTGC CATTTAGCGA ATAA
|
Protein sequence | MMDSATLSRH RIVRTITLLI LTSFILGLFA FTLQSTAAQT TPTASDKAYG WLQAQQLPNG LVDSFEHGGA ADDLCVVYDQ AVAAIAFVVK ADYDRARAVL TALRGSQWDD GSWHNIYACS NPNQVMEWHR DVGPAVWVAL AVASYEAATG DLVTYREMAL RAVNFGFGFQ QDDGGVNGGF EATQQGYRFH SWGSTEHAID LYAAARYFYG DQPRTGQVKQ FLETVVWDSV DGRWLGGRAD LRDPLDVNTW GVAALGDEYI MALEYALATH RVTLGGIDAV DFNSDRNDIW FEGTGQLIVA LQAVGRTSDA QYFLQQMAKG QKANGGIPYS LQGTNNGYWT MSTANAVSSA AWLIFADAAF NPLGVGTATI TSVAPTPDPV VVEPPVSASD KAYGWLRSQQ LANGLVDSFE HGGAADDLCV VYDQAVAAIA FVVKADYDRA RAVLTALRGS QWGDGSWHNI YACSNPNQVM EWHRDVGPAV WVALAVASYE AATGDLVTYR EMALRAVNFG FGFQQDDGGV NGGFEATQQG YRFHSWGSTE HAIDLYAAAR YFYGDQPRTG QVKQFLETVV WEAAKGRWLG GRADQRDPLD VNTWGVAALG DEYIMALEYA LGTHRVTLGG IDAMDFNSDR NDIWFEGTGQ LIVALHAVGR TSDAQYFLQQ MAKGQTANGG IPYSLQGTNN GYWTMSTANA VSSAAWLIFA DAAFNPLGLG TATITSVAPT PDPVVVEPPV SASDKAYGWL RSQQLPNGLV DSFEQGGAAD DLCVVYDQAV AAIAFVVKAD YDRARAVLTA LRGSQWGDGS WHNIYACSNP NQVMEWHRDV GPAVWVALAV ASYEDATGDL VTYREMALRA VNFGFGFQQD DGGVNGGFEA TQQGYRFHSW GSTEHAIDLY AAARYFYGDQ PRIGQVKQFL ETVVWEAAKG RWLGGRADQR DPLDVNTWGV AALGDEYIMA LEYALGTHRV TLGGIDAVDF NSDRNDIWFE GTGQLIVALH AVGRTSDAAY FLHQMAKGQT ANGGIPYSLQ GTNNGYWTMS TANAVSSAAW LIFADAAFNP LGLIGDDQAP TAPTNLTASL ITATTAQLAW NAANDNIGVE GYAVYLNNAA TPATECRAVR TTQCTLTSLN PLTTYSITVK AFDAAGNVSL ASNSINITTP DVDRIAPTAP TNLNATNVTA ASILLSWTAS SDNVGVVGYN AYLGTNPIAV TSCNGVITTV CNLTGLAPDT QYSITVKAFD AAGNASLASN SILVTTLIDT QAPTPPLNLQ VLAKTTTTVD INWTASSDNV GVIGYNAYLG TNPIAVTSCN AVTATMCSLT GLTPNTQYSI TVKAFDAVGN VSAASDGLVV RTNPLPTFGN PWYLLDGATH ITPSQLSTTS GNAAASDSIP AASATNIDGQ ATQTISYEAE NLTASYDSSK TTEFELFIDA GTAVGNGTQA RIAYDFTGDG TWDRIETYNY FATDPVNSWE RYTHTRGLRS ATGSFANLTN GKVKLELWNA IGNNPSLIRT NASSADGQQS TLVLPFLVTT VDNEAPTAPT NLLRGVTTAN SAAFTWQAAS DNVGVVGYSA YLNGSQVAVA NCQMVTGLGC NLTGLSANTS YSVVVTAFDA AGNQSEASAS ITFTTSDLDS IQPTAPTNVA IANIGATTAT VSWNAATDNL GVVGYSVYLN GADQAASGCS MVDALNCTLS GLNADTSYTL VVTAFDAAGN QSEPSATVTF TTQANPFRSQ LYLLDGASQT ADGILSVIPG NAADSDPLAS AQNGNWDGLP TNAITYQIND LTATYDPAKT TQFNLYLDAG IGVGNASQVR VSYDFTGDGT WDRIETYNYF ATDPVVGWEL YNHTRGLRSA TGSFANLTNG RLKVELWNAI GNGGSLVRTS ATTADGQQST LTLPFSE
|
| |