Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1249 |
Symbol | |
ID | 7317738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 1340821 |
End bp | 1343565 |
Gene Length | 2745 bp |
Protein Length | 914 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643616137 |
Product | Tfp pilus assembly protein FimV-like protein |
Protein accession | YP_002513322 |
Protein GI | 220934423 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3170] Tfp pilus assembly protein FimV |
TIGRFAM ID | [TIGR03504] FimV C-terminal domain [TIGR03505] FimV N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.358578 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCATGG CCCTCTGGGC GCTGTTACTG GTACCGGGAC TGTCACAGGC GCTGGGTCTT GGGGAGATCG AAGTCAATTC GGCCCTCAAT CAGCCGCTCA ACGCGGAGAT CGAACTGGTC TCGGTGCGCC CCGGCGAGAT GCAGGATCTG CAGGTCCGCG TTGCCCCGGA GGCGCTCTAC CGCCGCCTGG GCATCGAACG TTCCGCCATC ATCACCGAAC TGCGTTTCGC CCCGACGACC CTGCCGGACG GCCGTCACGT GATTCGCGTC ACCACCAGCA ACCCGGTGCG CGAACCCTTC GTGAACTTCC TGGTGGAGGC CACCTGGCCC GCCGGCCGCC TGGTGCGGGA ATACACCCTG CTGCTGGACC CGCCGGTGCT GTTCGAGCAG CGCGCCGAGC CCGCCCCCCG GGCACCGGTC ACCGAGGCCC CGCGCGAAGC CCGGCCGACC CCTGCGCCTG CCGCTCCCCG TCCTGCTGCC CCGGCGCCCC GTGCCACCCT CGATACCTAC CGCGTGCAGC GCGGCGACAT GCTCTGGAAC GTGGCCCGGG ATCTGCGTCC CGATGCCAGC GTCAGCGTCG AGCAGATGAT GCTGGCCCTG CTGCGCGCCA ATCCCGAGGC CTTCACCGAC AACAACATCA ACAACCTGCA GAGCGGCCGT GTGCTGCGCG TGCCGGACAT GGCGGAGATC AACCGCCTGA ATCAGGCCGA AGCCCGCACT GAAGTGGCCC GACAGAACGC CCTGTGGCAG GAATACCGCA CCCGGGTCGC CGCGGCACCC CAGCCCCAGC TTCCCGCCCA GCCGGGCACC GAGACCGCCG TTGCGCCCCG TGCCGAGGCC GCGCCGGCAC CGGCCGAGCA GGATGCACGG CTTGAGATAC TGGCCGCCCG CGACACCGAC GAGGCCCGTG CCGAGGTCGC GCGTGAACTG GCGCTGGCCC GGGAAACCGC CGAGACCCGT GGCCGCGAGG CCGAGGAACT GCGTTCCAGG GTGGCTGAAC TGGAATCCCT GCTGTCGCGT AGCGAGCGGC TGCTGGAACT GAATAACGCC CAGATGGCCG AGCTGCAGGC CCGCCTGGAT GAGCTCGAAG GTCGCGAACC CGCAGAGCCC GCTGAAGTGG CCGAGGCGCC CGCAGTGGTT GCGCCCGAGG AGCCGCCGGT CGTGGCCGAG ACTGAACCGG CCCCCGCGCC TGTGGAGCCT GCGGTAGAAG CCGAACCCGT TGTGCCTGGC GAAGCCGTGA CCCTCAGCGA GGTCGAGCGC ATGGCCGAGA CCGCCGAGGC GCCGGCCGTC GAGGCTGCCC CGGAACCGGA GGTGGCCGCG GCCCCCGCGC CGGCCCCGGC CCCCGCAGCG CCCGTCCAGC CCCGTCCTGC GAGCCTGGTC GACGAGATCA TGGCCAGCCC CAACCTGATG ATGATCGCGG GTCTGGTGCT GCTGCTGGTG CTGCTGCTCA TCTGGCTGAT GATCAAGCGT CTCGGCAAGG GCAAACGTGC GGCCGCCGCC TCCGGCATGG CCTCACCGGC CGGCGGCCGG ACCGAACCCG CCCTGTCCGG TGGCGAGGGC GCCCTCGCGG CGGCTGCCGC CGGCGGTGCC CTGGCCGGTG CTGCCGCCAT CGCGGCCGAT CGCGGTGAGG CGGAAGGTCA GGAAGCCGAA CTGAGCGAAG CCGAAGAAAT GGCAGCCGCC GGGACCGAAG AGACCGAGGC CCCTGCCGCT GGTGTCGAGA GCGAGGAGAT CATCAACGAC GACACCATCG CCGAGGTGGA TGTCTACCTG GCCTACGGCC TGTATTCCCA GGCCGAGGAT CTGCTCAATA AGGCCGTGCA GGAACACCCC GATCGCGTGG ATTACCGCTT CAAGCTGGCG GAGACTCATT ACGCCAACCG TAACGTGGAG GCCTTCGAGG CCAACGCCCA GGCCATGCAC GATACCCTGG CAGGCCGTCC GAGCCAGCTG TGGGAACGCG TGCAGTCCAT GGGGCAGGAT CTGAACCCGG CCAATGCACT GTTCTCCGGT GCAGCCGCCA TGGATTTCGG CGGTGATGCC GGCGCCCCCG AGGTGGATCT GGACCTGGGC GTGCAGGACC TGGAGGGTTC CCTGGAAGAC CTGGACCTGG GCGAGGAACC GGCTGCCGAG CCGGCTGAGA CGCCGGAGAT GGAAGCCGCC GGCGAGGTGG AGCTGGATCT GGACGTGGAC GAGAAGCCCA AGGCCTCAGC CGATGAGGCC CTGGAATTCG ATCTGGGCGA CCTGGACCTG GGAGGAGAGG GCCTGGATAC GGAACTGCTC GACACCACGG CGGAAACCCA GGTGGAAGAG CCCGTGGCGT CCAAGCCGCT GGACGTGTCC GCGGAGCTTG AGGAGAGCCT GGAGATCGGT TCCCTGGACG AGGATCTCAG TGCCGGCCTC GATGAGGAAC TCGCGGCCTC CTTCGACCTG GAAGGCGAGA CCGCGGCGGC GGAACGCCCC GTGGGCATCG ATACCGCCGA AGACGACACC CAGATCATGG AGGAGGATTT CTCCGAGATC GATTTCCTCG GCGAGGGTGC GGAAGGCGAT GATCTCCTGG TCACCGACGA GGAGGAGCAG GCGCTGCCCT CCGGAGACGA GGTCAGCACC AAGCTGGATC TGGCCCGTGC CTACATCGAC ATGGGTGATT CCGAAGGGGC GCGCAGCACT CTCGAGGAAG TCCTGACCGA AGGCAACGCC AGCCAGAAAC AGGAGGCCCA GGGGCTGCTC GACCAGCTGT CCTGA
|
Protein sequence | MGMALWALLL VPGLSQALGL GEIEVNSALN QPLNAEIELV SVRPGEMQDL QVRVAPEALY RRLGIERSAI ITELRFAPTT LPDGRHVIRV TTSNPVREPF VNFLVEATWP AGRLVREYTL LLDPPVLFEQ RAEPAPRAPV TEAPREARPT PAPAAPRPAA PAPRATLDTY RVQRGDMLWN VARDLRPDAS VSVEQMMLAL LRANPEAFTD NNINNLQSGR VLRVPDMAEI NRLNQAEART EVARQNALWQ EYRTRVAAAP QPQLPAQPGT ETAVAPRAEA APAPAEQDAR LEILAARDTD EARAEVAREL ALARETAETR GREAEELRSR VAELESLLSR SERLLELNNA QMAELQARLD ELEGREPAEP AEVAEAPAVV APEEPPVVAE TEPAPAPVEP AVEAEPVVPG EAVTLSEVER MAETAEAPAV EAAPEPEVAA APAPAPAPAA PVQPRPASLV DEIMASPNLM MIAGLVLLLV LLLIWLMIKR LGKGKRAAAA SGMASPAGGR TEPALSGGEG ALAAAAAGGA LAGAAAIAAD RGEAEGQEAE LSEAEEMAAA GTEETEAPAA GVESEEIIND DTIAEVDVYL AYGLYSQAED LLNKAVQEHP DRVDYRFKLA ETHYANRNVE AFEANAQAMH DTLAGRPSQL WERVQSMGQD LNPANALFSG AAAMDFGGDA GAPEVDLDLG VQDLEGSLED LDLGEEPAAE PAETPEMEAA GEVELDLDVD EKPKASADEA LEFDLGDLDL GGEGLDTELL DTTAETQVEE PVASKPLDVS AELEESLEIG SLDEDLSAGL DEELAASFDL EGETAAAERP VGIDTAEDDT QIMEEDFSEI DFLGEGAEGD DLLVTDEEEQ ALPSGDEVST KLDLARAYID MGDSEGARST LEEVLTEGNA SQKQEAQGLL DQLS
|
| |