Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1990 |
Symbol | |
ID | 5733879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2450884 |
End bp | 2453481 |
Gene Length | 2598 bp |
Protein Length | 865 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279134 |
Product | tail collar domain-containing protein |
Protein accession | YP_001544761 |
Protein GI | 159898514 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAATC CATTTGCGCA AGCATTAACT CTTTATTTAC CACTTGATAG CATGGGCATC AACAACACCA GCGTCAGCGA TCTTTCGGGC AACCGTAATC ATGGCACAAT TCACGGCAAT GTGATGGTTG TGCCTGATGA TCAGGTTGGC AGTTGCGCCT GTTTTGATGG TCAAAGTTGG GTCGAGTTAG CCAATCCATT TGCCAGTGCC AGCGATTTTA CGCTGGCGTT GTGGGTGCGG CCAACCCGCT TCGATGGAGC CTACCATGGT TTTATTGGCA AGCAAGCCGC CGAAGATTTG TATCGCAAAC CAAGTATGTG GGTGATGGGT GATGGTGGTT TGCATCTTGA TAGTTACTCG TCTGATGGAA CTCGCTTTCA TTATGAATTA GCAGGCTTTT TTGCTCAGCC AAATGAGTGG GTGCATGTGG CCTGGGTTAA ATCGGGCACG GCCTATACAA TCTATCGCAA TGGGGTGGCC TTTACCGAAC GACCAGCCCC TGCCGAAGTC TATGTGCCAG CCAGTAGCTA TTGGCTTGGC AAAGTCGATA ATTTGTTTGA TGGATGCTTG GCGCATGTGC GAATGTACAA TCAGGCGCTT GATCCTGCTG CGGTGGCCGA TATTGCCGCC CATGATCGGG TCGCTCGCAT GGTGTTTCGG GCAAGTTATC CGCTTGATTT CAATTTGCTG AATTCGCAGC AAGAGCCAAA TCTCAACCCT GGCACAAACC CATTGACCTT GACCCTGACC AATGCCAGCG CCCAAAGCAT CGAATTATCG CCGCTTGATC GCAGTGCTCC AGCGCAACAG CAACACTTCA GTTTTAGTTT TCGCCCCAAT TTATTGGCAA TTAACACTGG AATTGCAATT GATCATCCGG CTTGGCAGGT TAGCACTCAG TCAATGAGCG ATGGCCGCAT GAATATCCTT GTTCGCTCGA CCGAGGCTCA AACGTTAACT CCCAATCAAA CCTTGCGCTT TGCACTGAGT GGGATTACGG TTTTGCCTCA AGATGGCAGC CACTCAACCC AAATTGAAAT GCAATATAGT AATTTACGCT ATGTTGGTGA AATAAGTCTG TTAAACGGAA GTCGTCTGCA ACGCATTAGT ATTAGTACCG ATGATAGTGC CTTGGATTTG CCGTTGCACC TGAGTTTGAG CAACGGTGCA ACAATTCTCA ACACCAACCA GCCCAACCAT TTAGTTGCCC GAATAAGTAA CACCTCAACC CATAGCACAT TGCACTTTAA TCAATCCGAG CCACAGAGCC ACTTAATTGT GCGGTTTGAT GGCAGTGCCA GTGCTGAGCC ATGGGCCTTG GCAACGCCCG ACCAAATTAA TGCGATTACG ATTGAGGTAG CGGGCTGGGA TGTGCAACGT CAACAGCAAG CTGGTCAAAC GACGTGGGTT TGTCGTCCAC TCAGCGATGT GGCTTTGGCT CCAGGCGCAG CGCTTGACTT ACAGATCAAC AATATCGTGA CGACCCACCC GCAAGGCAAT ACGACCCTGT ATGTTGTGGT GCACGAGCTG CAAGGTTTCA ACGATACCAC CTTAACGACG ACAATTACCA AAACCTCGAT GAACACTATT AGTAACGCTG GGCAAACTCA AAATACGCTT GCACTTGGCG AAAATGGCTT TATCAGCGGA GCGGGCTACA ACACCTTGGT AGCTCAAACG ACACTGAGCG GTGGTGGTCG AATTAGCTGG CGCAACCGTA AAGTTCGCTG GACGCAGCGC TTTTTGGCGA TTAGTATGGG GCATACTGGC TTTCCGGTTG GACACTTTAA TATTGCATAT CCAACTGCAC CAATTCCTGC TGCCGATTGC TACGATAACC TCGAACGACC AGTGAGTGAT GGCATCGAAT TGCGTGATTG GGAAGCGTTA TATGCGATTT ATACTCCCAG CACCAGCCCC AGTACCACCA GTTTACGAAT TGTGCATTAT GCCAAGCCTT TCAATCTTGA AGGCCGGGCA GTGCTGGTGG CCGTTTTTAA TGCTGATGAT CGCACCTTGA AGCTTGGCTC TGGCCTGACG TTGAGCCATC AAGGAACCTA TTCCAATGGC AGCCCAATTC CATGTGGCAC CATTCAAATG TGGTCGGGCA TGGAAGTGCC TGAAGGCTGG GCGATTTGTG ATGGCCGCGA AGCCAACGGC TTGCGCACCC CTGATTTGCG CAACCGCTTT ATTGTTGGAG CTGGGGCCAA TTACGATAGT GGCAACCTCA GTGTTTATGG TACGAATCAA GGTACAACTG GCGGCAGCGA TGTAGTGGCA TTAACCCTCG ATCAAATGCC GCGCCACACC CATGGCGGTT CAACCAATGC CGCAGGCGAC CATAGCCATT GGGTTGAAGG CACTGATGCC GATGGCTTAG CCAAACGTCG CCGTCACCAT TGGGGCGATA CTACCGTCGA TATGGGTTTT GGTGGTGGCC GCAACGCCGA CCCTAACGAT GAACGCTGGC GTGGCCGGGT CAATACCGAT AATGCTGGTA CCCATAGCCA CGGCCTGATG ATTGGTGAGG TTGGTGGTAG CCAAGCCCAC GAAAATCGCC CGCCATTCTA TGCGCTCGCC TTCATTATGA AAGTTTAA
|
Protein sequence | MTNPFAQALT LYLPLDSMGI NNTSVSDLSG NRNHGTIHGN VMVVPDDQVG SCACFDGQSW VELANPFASA SDFTLALWVR PTRFDGAYHG FIGKQAAEDL YRKPSMWVMG DGGLHLDSYS SDGTRFHYEL AGFFAQPNEW VHVAWVKSGT AYTIYRNGVA FTERPAPAEV YVPASSYWLG KVDNLFDGCL AHVRMYNQAL DPAAVADIAA HDRVARMVFR ASYPLDFNLL NSQQEPNLNP GTNPLTLTLT NASAQSIELS PLDRSAPAQQ QHFSFSFRPN LLAINTGIAI DHPAWQVSTQ SMSDGRMNIL VRSTEAQTLT PNQTLRFALS GITVLPQDGS HSTQIEMQYS NLRYVGEISL LNGSRLQRIS ISTDDSALDL PLHLSLSNGA TILNTNQPNH LVARISNTST HSTLHFNQSE PQSHLIVRFD GSASAEPWAL ATPDQINAIT IEVAGWDVQR QQQAGQTTWV CRPLSDVALA PGAALDLQIN NIVTTHPQGN TTLYVVVHEL QGFNDTTLTT TITKTSMNTI SNAGQTQNTL ALGENGFISG AGYNTLVAQT TLSGGGRISW RNRKVRWTQR FLAISMGHTG FPVGHFNIAY PTAPIPAADC YDNLERPVSD GIELRDWEAL YAIYTPSTSP STTSLRIVHY AKPFNLEGRA VLVAVFNADD RTLKLGSGLT LSHQGTYSNG SPIPCGTIQM WSGMEVPEGW AICDGREANG LRTPDLRNRF IVGAGANYDS GNLSVYGTNQ GTTGGSDVVA LTLDQMPRHT HGGSTNAAGD HSHWVEGTDA DGLAKRRRHH WGDTTVDMGF GGGRNADPND ERWRGRVNTD NAGTHSHGLM IGEVGGSQAH ENRPPFYALA FIMKV
|
| |