Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2507 |
Symbol | |
ID | 5734388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3202591 |
End bp | 3204132 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641279647 |
Product | type II secretion system protein E |
Protein accession | YP_001545273 |
Protein GI | 159899026 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4962] Flp pilus assembly protein, ATPase CpaF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000823712 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACTTC TCAAACGTCT TGGAAGCACG CCAACAAGCC CAGAGCCAAC TCCTCCTGCT CAGCCAGTTC AACCGATGGC TGGCGAACAA TCTATGCCTG AATATCGCTC ATCATCGTTG ACTCCACTAA GCCCACCACC ACCAGCACCA TCATCGCCAC CCATGCCTGG CGGTAACCTC AGTGCTTTGG CTGGCTCTAG CTCAGGGTTA AGTGGCGGTA CGATCAACAC CAGCGGCTTT TTGGTCTTGG CTGATCGCGA TGCCCATCGC AAAATGCTGG AGCTTTCGCT CTGGATTGTC GATAAAATTC AGGCTTCGGT TGGTTCGCAA ACCCAACTAC AACGCAACGA AGATTCGGAG CGCCTCATCC AAGAGCGCTT TACCACCTTT GTGCGTCAAT CCACTACCAA TCTCGACCAA GATGGCACCA AATTGCTCTA TCAAATGGTG CTCGATGAAT TATTTGGCTT CGGGCCACTT GAGCAATTGA TTCGCGACGA TACCATCACT GAAATTATGG TCAACAGCGC CACGGTGGTG TATGTTGAAC AAAAAGGTAA ATTAACCCTT TCGCCAGTCA TTTTTGCCTC GGAAGATCAT GTGCTCAAAG TGATCGACCG AATTATTCGG CCACTCGGGC GACGGGTTGA CCGCAAATGG CCCATGGTCG ATGCGCGTTT GCCCGATGGT TCGCGGGTCA ATGCCATCAT CCCGCCTTGT GCAATCGATG GTTCGTCGCT GAGTATTCGT AAATTCTCCA AGAAAAAACT CCAAGTTAGC GATTTGATCA ATTATGGCTC GATGACCAAG GAAATGGCCG ACTTCCTGAA TGCTTGTGTG GTCAGCGCCA TGAACTTAAT CGTCTCGGGT GGTACTGGTT CGGGTAAAAC CACACTCCTA AACGTGCTCT CCAACTTCAT CCCCGACCAC TATCGGATTT GTACAATTGA AGACTCAGCC GAACTACAAC TTGGTAAAGA TCACGTGATT CGGCTTGAAT CCAAGCCTGC CGATGTTGAT GGCTCGGGCT TGGTCACCAT CCGCGACCTT GTGAAGAACT CGCTGCGGAT GCGCCCTGAC CGCATTGTGG TAGGGGAAAT TCGTGACGGC GCGGCGCTCG ACTTGCTCCA AGCCATGAAC ACTGGCCACG ACGGCTCAAT GTCCACCGTC CACGCCAACA CCCCACGCGA CGCAATTCGC CGTTTGGAAA CCTTGGCACT GATGTCGGGA CTCGATTTAC CAGTTGCGGT TATTCGCGAA CAAATTGCTT CGGCAGTACA CGTGATTGTG CAGCAAGCCC GTTTGCGCGA CGGCTCACGG AAAGTCGTAG CGGTAACTGA AGTTCAAGGC ATGGAAAGTG GTCAGGTCGT GCTACAAGAT ATCTTCATCT TTGAAGATCA AGGTACTGCG CCAGATGGCA AAGTGATGGG TGTGCTGCGG CCAACCGGGA CGCGTCCACG CTTTACGCCA GTGCTCGAAG CCAAAGGCTT CAAATTGCCT CCATCAATCT TCGGCGCAAC CTTCCCCGGC ATGCGGCGCT AA
|
Protein sequence | MSLLKRLGST PTSPEPTPPA QPVQPMAGEQ SMPEYRSSSL TPLSPPPPAP SSPPMPGGNL SALAGSSSGL SGGTINTSGF LVLADRDAHR KMLELSLWIV DKIQASVGSQ TQLQRNEDSE RLIQERFTTF VRQSTTNLDQ DGTKLLYQMV LDELFGFGPL EQLIRDDTIT EIMVNSATVV YVEQKGKLTL SPVIFASEDH VLKVIDRIIR PLGRRVDRKW PMVDARLPDG SRVNAIIPPC AIDGSSLSIR KFSKKKLQVS DLINYGSMTK EMADFLNACV VSAMNLIVSG GTGSGKTTLL NVLSNFIPDH YRICTIEDSA ELQLGKDHVI RLESKPADVD GSGLVTIRDL VKNSLRMRPD RIVVGEIRDG AALDLLQAMN TGHDGSMSTV HANTPRDAIR RLETLALMSG LDLPVAVIRE QIASAVHVIV QQARLRDGSR KVVAVTEVQG MESGQVVLQD IFIFEDQGTA PDGKVMGVLR PTGTRPRFTP VLEAKGFKLP PSIFGATFPG MRR
|
| |