Gene Haur_2466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2466 
Symbol 
ID5734346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3147967 
End bp3153630 
Gene Length5664 bp 
Protein Length1887 aa 
Translation table11 
GC content54% 
IMG OID641279605 
Productfibronectin type III domain-containing protein 
Protein accessionYP_001545232 
Protein GI159898985 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.401987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGACT CAGCAACGCT CTCTCGGCAT CGTATAGTTC GAACGATCAC GCTGCTTATC 
CTCACAAGTT TTATCCTCGG TCTTTTTGCA TTTACGCTTC AATCAACGGC TGCTCAAACT
ACTCCAACGG CCTCGGATAA GGCCTATGGT TGGTTGCAAG CGCAGCAGTT GCCAAATGGA
TTGGTCGATA GCTTCGAGCA CGGGGGGGCT GCCGATGATC TGTGTGTGGT CTATGACCAG
GCGGTGGCAG CGATCGCCTT TGTGGTCAAG GCTGATTATG ATCGTGCTCG GGCGGTGCTG
ACCGCTTTGC GTGGCTCGCA ATGGGATGAT GGCAGCTGGC ACAACATTTA TGCCTGTAGC
AATCCCAATC AAGTGATGGA ATGGCATCGC GATGTCGGGC CAGCGGTGTG GGTGGCCTTG
GCAGTGGCGA GCTATGAAGC CGCAACCGGC GATCTGGTGA CCTATCGCGA GATGGCGTTG
CGAGCGGTGA ACTTTGGCTT TGGCTTTCAG CAGGATGATG GCGGGGTGAA TGGTGGCTTT
GAGGCGACCC AACAAGGCTA TCGGTTCCAC AGCTGGGGAT CGACTGAGCA TGCGATCGAC
CTCTATGCGG CGGCTCGCTA TTTTTATGGC GATCAACCAC GCACTGGGCA GGTGAAGCAA
TTTCTTGAAA CAGTCGTGTG GGATTCGGTC GATGGTCGTT GGTTGGGTGG TCGTGCCGAT
CTGCGTGACC CATTGGATGT GAACACCTGG GGTGTGGCCG CGCTGGGCGA CGAGTATATC
ATGGCGTTGG AGTATGCATT GGCAACGCAT CGGGTGACCT TAGGCGGGAT CGATGCGGTG
GACTTCAACA GTGACCGCAA TGATATTTGG TTTGAGGGCA CCGGCCAATT GATTGTAGCC
TTGCAGGCGG TGGGCCGCAC CAGCGACGCG CAGTATTTTC TGCAACAAAT GGCGAAAGGT
CAAAAGGCCA ATGGGGGAAT TCCCTATTCG TTGCAAGGAA CGAACAACGG CTATTGGACA
ATGAGCACGG CGAATGCGGT GTCGAGCGCG GCCTGGTTGA TTTTTGCTGA TGCCGCGTTC
AACCCGCTCG GTGTAGGCAC AGCCACGATC ACGAGTGTTG CGCCAACGCC TGATCCGGTG
GTAGTGGAGC CGCCCGTAAG TGCCTCGGAT AAGGCCTATG GCTGGTTGCG CTCACAGCAA
TTGGCGAATG GCTTGGTCGA TAGCTTCGAG CACGGGGGGG CTGCCGATGA TCTGTGTGTG
GTCTATGATC AAGCGGTGGC AGCGATCGCC TTTGTAGTGA AAGCCGATTA CGATCGTGCT
CGTGCCGTAT TGACCGCCTT GCGTGGCTCG CAATGGGGTG ATGGCAGTTG GCACAACATT
TATGCCTGTA GCAATCCCAA CCAAGTGATG GAATGGCATC GCGATGTCGG GCCAGCGGTG
TGGGTGGCCT TGGCAGTGGC GAGCTATGAA GCCGCAACCG GCGATCTGGT GACCTATCGC
GAGATGGCGT TGCGAGCGGT GAACTTTGGC TTTGGCTTTC AGCAAGATGA TGGCGGGGTA
AATGGTGGGT TTGAGGCAAC GCAACAAGGC TATCGGTTCC ACAGCTGGGG ATCGACTGAG
CATGCGATCG ACCTGTATGC GGCAGCCCGC TATTTTTATG GTGATCAACC GCGCACCGGG
CAGGTGAAGC AGTTTCTTGA AACGGTGGTG TGGGAAGCCG CCAAAGGTCG TTGGCTGGGT
GGCCGTGCCG ATCAGCGTGA CCCGTTGGAT GTGAACACCT GGGGCGTGGC TGCACTGGGC
GACGAGTATA TCATGGCCTT GGAGTATGCA TTAGGCACGC ATCGGGTAAC CTTGGGCGGA
ATCGATGCGA TGGATTTCAA CAGCGACCGC AATGATATCT GGTTTGAGGG CACTGGCCAA
TTGATTGTAG CGTTGCATGC GGTGGGCCGC ACCAGCGACG CGCAGTATTT TCTGCAACAG
ATGGCGAAGG GCCAAACGGC GAATGGGGGC ATCCCCTATT CGTTGCAAGG GACGAATAAT
GGCTATTGGA CGATGAGTAC GGCGAATGCG GTATCGAGCG CGGCCTGGTT GATTTTTGCT
GATGCCGCGT TCAATCCGCT CGGTTTAGGC ACGGCGACGA TCACGAGTGT TGCGCCAACG
CCTGATCCGG TGGTGGTGGA ACCGCCCGTA AGTGCCTCGG ATAAGGCCTA CGGCTGGTTG
CGTTCACAGC AATTGCCAAA TGGCTTGGTC GATAGCTTCG AGCAAGGTGG GGCTGCCGAT
GATCTGTGTG TGGTGTACGA CCAAGCGGTG GCGGCAATCG CCTTTGTGGT GAAAGCCGAT
TATGATCGTG CTCGGGCGGT GTTGACGGCT TTGCGAGGCT CGCAATGGGG CGATGGCAGT
TGGCACAATA TTTATGCTTG TAGCAATCCC AACCAAGTGA TGGAATGGCA TCGCGATGTC
GGGCCAGCGG TGTGGGTGGC CTTGGCAGTG GCGAGCTATG AAGATGCGAC GGGCGATCTG
GTGACCTATC GCGAGATGGC GTTGCGAGCG GTGAACTTTG GCTTTGGCTT TCAGCAGGAT
GATGGTGGAG TCAATGGTGG CTTTGAGGCA ACCCAACAAG GCTATCGGTT CCACAGTTGG
GGATCGACTG AGCATGCGAT CGACCTCTAT GCGGCGGCTC GCTATTTTTA TGGCGATCAA
CCGCGCATCG GGCAGGTAAA GCAGTTTCTC GAAACGGTGG TGTGGGAAGC CGCCAAAGGT
CGTTGGCTGG GTGGCCGTGC CGATCAGCGT GACCCATTGG ATGTGAACAC CTGGGGCGTG
GCCGCACTAG GCGACGAGTA TATCATGGCG TTGGAGTATG CATTAGGCAC GCATCGGGTA
ACCTTGGGCG GAATCGATGC GGTGGATTTC AACAGCGACC GCAATGATAT CTGGTTTGAG
GGCACTGGCC AATTGATTGT AGCGTTGCAT GCGGTGGGTC GCACCAGCGA TGCGGCGTAT
TTTCTGCACC AAATGGCGAA AGGCCAAACG GCGAATGGAG GAATTCCCTA TTCGTTGCAA
GGAACGAACA ACGGCTATTG GACGATGAGT ACGGCGAATG CGGTGTCGAG TGCCGCATGG
TTGATTTTTG CCGATGCTGC CTTCAACCCG CTAGGCCTGA TTGGTGATGA TCAAGCACCA
ACTGCGCCTA CCAATCTAAC AGCTTCGTTA ATCACAGCGA CAACTGCCCA GCTTGCATGG
AACGCAGCGA ATGACAATAT TGGGGTTGAA GGCTATGCAG TGTATCTCAA TAATGCTGCT
ACTCCCGCAA CGGAATGTCG AGCCGTTCGG ACAACTCAAT GCACGTTGAC CAGCCTCAAT
CCCTTAACTA CCTACAGCAT TACGGTCAAG GCCTTTGATG CGGCAGGCAA TGTTTCGCTT
GCGAGTAATT CGATTAATAT AACTACACCT GATGTTGATC GGATTGCCCC GACGGCCCCG
ACCAATCTGA ACGCTACCAA TGTAACTGCT GCGAGTATCC TGCTTAGTTG GACGGCTTCG
AGTGATAATG TTGGCGTGGT TGGCTACAAT GCCTATCTTG GTACGAACCC AATTGCGGTT
ACCAGTTGCA ATGGCGTAAT TACAACTGTA TGTAACCTGA CCGGACTTGC GCCTGATACG
CAATACAGTA TTACGGTCAA AGCCTTTGAT GCGGCGGGCA ATGCTTCGCT TGCAAGCAAT
AGCATCCTTG TGACAACCTT GATTGATACG CAAGCACCAA CGCCACCGTT AAATCTTCAG
GTTCTCGCTA AAACCACCAC GACGGTTGAT ATAAATTGGA CGGCTTCAAG TGATAATGTT
GGTGTGATTG GCTACAATGC CTATCTTGGT ACTAACCCGA TTGCAGTTAC TAGTTGCAAT
GCTGTAACGG CAACCATGTG TAGCCTGACT GGACTTACGC CCAATACTCA ATACAGCATT
ACGGTCAAAG CGTTTGATGC GGTGGGTAAT GTTTCTGCTG CCAGCGATGG CTTGGTGGTT
CGTACCAACC CATTACCAAC CTTTGGTAAT CCTTGGTATC TGCTTGATGG CGCTACGCAC
ATCACGCCGA GCCAACTCAG TACAACCAGT GGCAATGCCG CCGCCAGCGA TTCGATTCCA
GCGGCGAGTG CAACTAACAT CGATGGTCAG GCGACTCAAA CCATCAGCTA TGAGGCCGAA
AATCTCACGG CTAGCTACGA TAGCAGCAAA ACCACCGAGT TTGAGCTGTT TATTGATGCA
GGCACGGCGG TTGGCAATGG TACACAAGCA CGCATCGCCT ACGATTTTAC TGGCGATGGC
ACGTGGGATC GGATTGAAAC CTACAACTAT TTTGCAACTG ATCCGGTGAA TAGTTGGGAA
CGTTATACCC ATACTCGTGG ATTACGCTCG GCAACAGGCA GCTTTGCCAA TTTGACCAAT
GGCAAGGTCA AGCTTGAGCT TTGGAATGCA ATCGGCAATA ACCCATCGCT GATTCGCACG
AATGCCAGCA GCGCCGATGG TCAGCAATCA ACCTTGGTTT TGCCATTCTT GGTAACAACT
GTTGATAACG AAGCTCCCAC TGCGCCAACC AATCTGCTAC GCGGTGTGAC GACTGCTAAC
TCAGCAGCCT TTACTTGGCA AGCCGCAAGC GATAACGTTG GGGTCGTCGG TTATTCAGCC
TACCTCAATG GTAGTCAGGT TGCCGTAGCA AATTGTCAGA TGGTCACAGG CCTCGGCTGT
AATCTTACTG GCTTGAGTGC CAATACCAGT TATAGCGTGG TGGTCACGGC CTTTGATGCT
GCTGGCAATC AATCCGAGGC TAGCGCAAGC ATCACATTTA CCACCTCCGA CTTGGATTCG
ATTCAGCCAA CAGCGCCAAC AAATGTTGCA ATTGCCAACA TAGGCGCTAC CACGGCTACG
GTCAGTTGGA ATGCAGCGAC TGATAATCTG GGGGTTGTTG GTTATTCAGT CTATCTGAAT
GGGGCAGATC AAGCGGCCAG CGGGTGCAGC ATGGTTGATG CGTTGAACTG TACGCTGAGT
GGCTTGAATG CTGACACTAG CTATACATTG GTGGTCACAG CCTTTGATGC TGCTGGCAAT
CAATCCGAGC CTAGCGCAAC TGTTACATTC ACAACGCAGG CTAATCCATT TCGTTCGCAG
TTGTATCTGC TTGATGGCGC TAGCCAAACC GCTGACGGAA TCTTGAGTGT GATTCCGGGT
AATGCGGCAG ATAGTGATCC GCTTGCGAGT GCCCAAAATG GCAACTGGGA TGGGCTGCCA
ACCAATGCTA TCACCTATCA AATTAACGAC CTGACGGCAA CCTATGATCC TGCGAAAACC
ACTCAATTTA ATCTCTACCT TGATGCAGGC ATCGGCGTAG GCAACGCCAG CCAAGTGCGT
GTTTCCTACG ATTTTACTGG CGATGGCACG TGGGATCGGA TCGAAACCTA TAACTATTTC
GCGACTGATC CGGTGGTGGG CTGGGAACTA TACAATCATA CACGTGGGTT GCGCTCAGCA
ACGGGCAGCT TTGCCAATTT GACCAATGGT AGACTGAAAG TCGAGCTTTG GAATGCAATT
GGCAATGGTG GATCGCTTGT GCGAACGAGT GCCACCACGG CTGATGGCCA GCAATCGACG
CTGACGCTGC CATTTAGCGA ATAA
 
Protein sequence
MMDSATLSRH RIVRTITLLI LTSFILGLFA FTLQSTAAQT TPTASDKAYG WLQAQQLPNG 
LVDSFEHGGA ADDLCVVYDQ AVAAIAFVVK ADYDRARAVL TALRGSQWDD GSWHNIYACS
NPNQVMEWHR DVGPAVWVAL AVASYEAATG DLVTYREMAL RAVNFGFGFQ QDDGGVNGGF
EATQQGYRFH SWGSTEHAID LYAAARYFYG DQPRTGQVKQ FLETVVWDSV DGRWLGGRAD
LRDPLDVNTW GVAALGDEYI MALEYALATH RVTLGGIDAV DFNSDRNDIW FEGTGQLIVA
LQAVGRTSDA QYFLQQMAKG QKANGGIPYS LQGTNNGYWT MSTANAVSSA AWLIFADAAF
NPLGVGTATI TSVAPTPDPV VVEPPVSASD KAYGWLRSQQ LANGLVDSFE HGGAADDLCV
VYDQAVAAIA FVVKADYDRA RAVLTALRGS QWGDGSWHNI YACSNPNQVM EWHRDVGPAV
WVALAVASYE AATGDLVTYR EMALRAVNFG FGFQQDDGGV NGGFEATQQG YRFHSWGSTE
HAIDLYAAAR YFYGDQPRTG QVKQFLETVV WEAAKGRWLG GRADQRDPLD VNTWGVAALG
DEYIMALEYA LGTHRVTLGG IDAMDFNSDR NDIWFEGTGQ LIVALHAVGR TSDAQYFLQQ
MAKGQTANGG IPYSLQGTNN GYWTMSTANA VSSAAWLIFA DAAFNPLGLG TATITSVAPT
PDPVVVEPPV SASDKAYGWL RSQQLPNGLV DSFEQGGAAD DLCVVYDQAV AAIAFVVKAD
YDRARAVLTA LRGSQWGDGS WHNIYACSNP NQVMEWHRDV GPAVWVALAV ASYEDATGDL
VTYREMALRA VNFGFGFQQD DGGVNGGFEA TQQGYRFHSW GSTEHAIDLY AAARYFYGDQ
PRIGQVKQFL ETVVWEAAKG RWLGGRADQR DPLDVNTWGV AALGDEYIMA LEYALGTHRV
TLGGIDAVDF NSDRNDIWFE GTGQLIVALH AVGRTSDAAY FLHQMAKGQT ANGGIPYSLQ
GTNNGYWTMS TANAVSSAAW LIFADAAFNP LGLIGDDQAP TAPTNLTASL ITATTAQLAW
NAANDNIGVE GYAVYLNNAA TPATECRAVR TTQCTLTSLN PLTTYSITVK AFDAAGNVSL
ASNSINITTP DVDRIAPTAP TNLNATNVTA ASILLSWTAS SDNVGVVGYN AYLGTNPIAV
TSCNGVITTV CNLTGLAPDT QYSITVKAFD AAGNASLASN SILVTTLIDT QAPTPPLNLQ
VLAKTTTTVD INWTASSDNV GVIGYNAYLG TNPIAVTSCN AVTATMCSLT GLTPNTQYSI
TVKAFDAVGN VSAASDGLVV RTNPLPTFGN PWYLLDGATH ITPSQLSTTS GNAAASDSIP
AASATNIDGQ ATQTISYEAE NLTASYDSSK TTEFELFIDA GTAVGNGTQA RIAYDFTGDG
TWDRIETYNY FATDPVNSWE RYTHTRGLRS ATGSFANLTN GKVKLELWNA IGNNPSLIRT
NASSADGQQS TLVLPFLVTT VDNEAPTAPT NLLRGVTTAN SAAFTWQAAS DNVGVVGYSA
YLNGSQVAVA NCQMVTGLGC NLTGLSANTS YSVVVTAFDA AGNQSEASAS ITFTTSDLDS
IQPTAPTNVA IANIGATTAT VSWNAATDNL GVVGYSVYLN GADQAASGCS MVDALNCTLS
GLNADTSYTL VVTAFDAAGN QSEPSATVTF TTQANPFRSQ LYLLDGASQT ADGILSVIPG
NAADSDPLAS AQNGNWDGLP TNAITYQIND LTATYDPAKT TQFNLYLDAG IGVGNASQVR
VSYDFTGDGT WDRIETYNYF ATDPVVGWEL YNHTRGLRSA TGSFANLTNG RLKVELWNAI
GNGGSLVRTS ATTADGQQST LTLPFSE