Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5240 |
Symbol | |
ID | 5737198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009974 |
Strand | - |
Start bp | 6308 |
End bp | 8647 |
Gene Length | 2340 bp |
Protein Length | 779 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641282404 |
Product | Type IV secretory pathway VirB4 protein-like |
Protein accession | YP_001547995 |
Protein GI | 159901750 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3451] Type IV secretory pathway, VirB4 components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCAGC AGCCGCTGTG TTTGACGATT GACCCCTTTT CGGTTCGCCA GTACCGTGAG GAATTGCCCC AGCTGGAGCA GCGCTTTGCC AACTTTTGGG CAGGCATCAC CTACGATGCA CGGCTGATCT CGTGTACGCG GCGCTTTTCG TTTGCGCCGA TTCGCCAGCG CTTGCGCCAA CAAACCAGCC CGCTTGACGA CCTGCGCGAT CTCATGCCCA TGCTGGTGTC AGCGCAGGAG GATGGAAGCC GAAACGCGGC ATTAACCAGC TTGGTCCAGA AGCGGCTCGC AACCTATGAA CGGGCGACTG CGGCATTACA GGATGCGCCA GCGGTGCATG CTGCTTTCCG CGCGATGGCC ACGGGTGCGA GCGATGCCAC GACGCTGGCC GTGGTCGCCG ATGGCTGCCG CCGTGCCCTT TGGCCGTGGC GGTGGCTCAA GAATTATCGG CGGGCGTATG AGGTCATGGA GCGCGAAGGC AATCCCCTTG GCATTCAGCA TTACTTTGTG GCCTGGCCGT CCGAGTATAC GGATGCCGAG GCGGTGCGCA GTGTGCTTAA GGGCACGTTC TTATTGCCCG ATGTCCAGAG CGCACCGCTA CCACCCTTGT TTCATGGGAA ATACCGCGAA ATGGCGACCT ACTTGACCCC GCTCGACGAG GGTCGTCCCT ACTTAAGGGT GATTCACGCC TTTGATGTGC GGGGTGAGTG GGATTTGGGC AGTATGCAGG AGCTTTTAGG CGGGGAAGAA GAACTGGCCG TCGCGCTGGA TGTCACCACC TTGCCCCGCG CCAAGGCCCA ACGCGCGACG ACCGATGCCT TTAATGTGCT TGAGGGTGCG TTGACGGCGC GGAATGCCGT GAAGGACTCG CGCAGCGAAC GCGCCTACCG CGATGTCAGC TACGCCATGA ACCAACTCGA TGTCCAGCAG TTGCACGAGG TGGCCTATGC GATCCTGATT CAAGCACCGA GCGTGCGGGA CTTGAATCGC ATCACCCAAA CCCTGCGTGA CCGCATGGGC GCACGGCTCA AACTGGATGT GCTGAGCGGA ACCCAAGGCG AATACCTCAA ATTGTTCACG ACCACGCCGA GCAAGCAGAT TGCCGCGCCG CTCATCCGGC GGAATGCCTT GAGTGAACAC GTCGCGGCCA AAACGCCCTG GGGGATTCGC AAGAGCGATG CGACCAGCGG GGTGCTGTTT GGCTACGATC CCCACGACCA ATTGCCGTAC CACTATAACC TCTTTGGCGC GACCGGAACC GACAACCCGC ACTTGCTGAT GCTCGGCAAA TCGGGCAGCG GCAAAACCGT GAGTTTAGGC ATGCTAGCGT TGCGCCATGC CGTCGCAGGC CACCAGATCG TGATGTTCGA TCCGGTCGGG AATTGTGCGC GGCTGTGTGA GGCGGTCGGC GGTGGGGCGG CCTACTATCA CTTGGCCGAG GACGTGGCGA TTAATGTGCT GGATCCAATG GAAACCAGCT TGCATCGCCA GAAAAGCCAT GTGGAGCGCA AGTTGTCCAT GGTCTTAGGC CGCGCGATCA CCAGTGGCTC TGGTGTCCAG TTGCGCCCGC GTGAGTTCAG TAACGCCGAA CGGGGAGCGC TCGATGCCGC CCTCGCATCG ACGCGGATCT ATGGCCCCGA TGGGGTGTTT TTGGCGCAGA TGGATGACGA CACCGCGCCG CTCTTGAGCG ATCTGGTGCT GGCCTTGCGC GAAACCAAGC GCCCCGTGGG GCAGGCATTG GCTGAGGAGA TCACCGATAT TGCCTTGCAA TCGCAAGCCC ACTTGTTTGA TCGCCAAACG ACCTTGAAGT GGGATTTTGG CAGCGATGTG GTGGCGTACA ACCTCAACAA CGCCGATAAA GCCTTGCTCC CCTTGTACCT TGATCATGGC ATCGGGGCAC TCAACCACTA CATTCGCAGC CCTGAACGAC GAGCGCGAGG CCAAAAGCTG GTCTGTGTGG TGGACGAGTT CGGCATTCTT TCACAAATTG AAAGCCTGAA AAAAGAGGTC GCTAATGCGA CGAAAGAATG GCGGAATTAC GGTGCAGCGC TTTGGTCGTG TGATCAGAAT TCGGCGACGT ACATGGGTGG TAGCGGCAAT GCCCAAGACT TCAATAACCT GACGACCAAC AACACCGCCG TGAAGCTGTT TGGGCGGCAA GAGGGAACCG ACGCGAACCT GCTTGGTGAG GCGTTTCCGG AGTTATCCCC CAGCGATATT GCAGCCATCA GAACGGCAGG CCCAGGCGAG TTTGTGGGCA TTTTCGGCAC AAACGAAGTC CACCATCTGC GCATGCAATT GACCGATCAA GAAGTGGCCC ACTTTATTCG GAAGGGTTAA
|
Protein sequence | MNQQPLCLTI DPFSVRQYRE ELPQLEQRFA NFWAGITYDA RLISCTRRFS FAPIRQRLRQ QTSPLDDLRD LMPMLVSAQE DGSRNAALTS LVQKRLATYE RATAALQDAP AVHAAFRAMA TGASDATTLA VVADGCRRAL WPWRWLKNYR RAYEVMEREG NPLGIQHYFV AWPSEYTDAE AVRSVLKGTF LLPDVQSAPL PPLFHGKYRE MATYLTPLDE GRPYLRVIHA FDVRGEWDLG SMQELLGGEE ELAVALDVTT LPRAKAQRAT TDAFNVLEGA LTARNAVKDS RSERAYRDVS YAMNQLDVQQ LHEVAYAILI QAPSVRDLNR ITQTLRDRMG ARLKLDVLSG TQGEYLKLFT TTPSKQIAAP LIRRNALSEH VAAKTPWGIR KSDATSGVLF GYDPHDQLPY HYNLFGATGT DNPHLLMLGK SGSGKTVSLG MLALRHAVAG HQIVMFDPVG NCARLCEAVG GGAAYYHLAE DVAINVLDPM ETSLHRQKSH VERKLSMVLG RAITSGSGVQ LRPREFSNAE RGALDAALAS TRIYGPDGVF LAQMDDDTAP LLSDLVLALR ETKRPVGQAL AEEITDIALQ SQAHLFDRQT TLKWDFGSDV VAYNLNNADK ALLPLYLDHG IGALNHYIRS PERRARGQKL VCVVDEFGIL SQIESLKKEV ANATKEWRNY GAALWSCDQN SATYMGGSGN AQDFNNLTTN NTAVKLFGRQ EGTDANLLGE AFPELSPSDI AAIRTAGPGE FVGIFGTNEV HHLRMQLTDQ EVAHFIRKG
|
| |