Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_37360 |
Symbol | tmp |
ID | 7762629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 3786701 |
End bp | 3789937 |
Gene Length | 3237 bp |
Protein Length | 1078 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643806603 |
Product | Phage TMP domain-containing protein |
Protein accession | YP_002800856 |
Protein GI | 226945783 |
COG category | [S] Function unknown |
COG ID | [COG5283] Phage-related tail protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCGTG ATCTGAGTTT GCAAGTCCGC CTGAGCGCGA TAGACCGGAT CACCGCGCCG TTGCGCCGCA TCGTCCAGGG TAGCGGCGCC CTGGCCCAGG CGATGAAGGC CAGCCAGGAC CAGCTCAAGG CCTTGAATCA GCAGCAGCGC GACTTGAGCG GCTATCGCCA GACGAATGTC GAGATCGCCC GGCAGACCAA GGCTATCCAG GCACTGCAGG CCAGGACCCG CGAGCACACC CAACTGCTGG AAAAGCAGCG GGCCGTGCAT GTCAACCTCA AGGGCAACCT GAAGGCGGCC CAGACGCAGT ACAACAAGCT GGCCAAGGCA CTGATCGAGG GCAAGGGCGA GACGGCCAGC TTCCACTTCG AGCTGGAGAA GGCCCAGATC AAGCTGCAAT CCGCCCAGCA GGCCTTCAAC CGCTCGTCCA GCACGATCAA GACCTACAAG GACCGTATTC GCCAGGCTGA CAGTCAGCTT GCCCAACTCG GCAGCCAGCA GCAGAACAGC CAGGAACGTC TGGCCGGCTA CAAGCGGCGG CTCGACGAGG CCGGCATCGG GACGGAGCGC CTGGGCAGTC GGGCCCGGCA ACTGCGCGGC GAACAGGAAC GCCTCAACGC CGTGCTCGAG GCGCAGAAAG CCCGGCTCGC TGCCGTCACC GCGCAGCAGG AGCGGCTGAC CAAGGCACAG AAGAGCTACG AACGCGCCCA GGCCGTGGCC GGCAAGATCG CCATCGGCGG GGCTGCCAGC TTGGCGAACG GCTACGCGCT GTCCCGCCCG CTGTCTGCCG TGATGGACGC CTACGCGCCA GCCGAGGACG CCGCAGCCCA GTTGCGCGCT TCGATGATGG GTGCCGATGG CAGCGTCTCG GCGGACTTCG AAAAGATCAG CGCCTTGGCC ACCCGCTTGG GCGACCGCCT GCCCGGGACC ACCGCGCAGT TCCAGGAAAT GATGACCATG TTGCGCCGGC AGGGCATCAG CGCACAGTCG ATCCTCGGCG GTACCGGCGA GGCGGCGGCC TACCTGGCGG TCCAGCTCAA GATGGGCAGC AGCGAGGCGG CCGAGTTCGC GGCCAAGATG CAGGACTCCA CGCGCACGAC TGAGGCTGAC ATGATGGGGC TGATGGACAC GATCCAGCGC ACCTTCTATC TCGGCGTCGA TCCGACCAAC ATGCTCCAAG GGTTCGCCTC GATCTCGCCA GCGCTGTCGA TGATCCGCAA AAGTGGGCTG GAGGCAGCCA ACACCCTGGC CCCGCTACTG GTCATGATGG ATCAGGCCGG TATGTCCGGA GAGTCGGCCG GCAACGCCCT GCGCAACGTG TTCCAGTCTG GATTCAAGAC TGACAAGGTT GCCAAGGCCA ACAAGATGCT GAAGAAGCTC GGCATCAGCC TCGACTTCAC GGACGGAAAA GGCGAGTTCG GCGGACTGGA GAAGCTGTTC GCCCAGTTGC AGAAGCTCCA GAAGCTGACC ACGGAAAAGC GCACCGCCGT CATCAGCGAG ATATTCGGCG ACGACTCGCA GAACCTGCAG GTGCTCAATA CCTTGATTGA CAAGGGCCTG GATGGCTACC GCGAAGTCGA GGCCAAGATG AAGGCCCAGG CCGACCTGCG CAAGCGCGTC GACGACCAGC TCAAGACCCT GACCAATGTC ATCGACGCAG CCCAAGGCAG TTGGACCAAC GCCATGGCTG AGTTCGGCGC AGCCGTGGCT CCGGAACTGA AGGGCTTGAT CCAGTGGCTC GGCAACGTCG CCAGTGGCAT TGGCGCCTGG GCGCGGGAAA ATCCGCAACT GGCCGGGACG CTGGTCAAGG TGACAGCAGG CATCGGGGCT CTGGCGGCGG CTGGTGGTGC GCTGGCCATC GGCATGGCCG GGTTGATCGG GCCGTTCGCC ATGGCCAAGC TCGGCCTCAG CGTCTTCGGC ATCCAGGCCG GTAGCGCCAT GGCGAGCACG GGGCTGTTGG GCAAGGCCCT GGGAGGACTT TCGACCAGCC TGTCGGGGCT GGGCGCGGCC TGGCAAGCCG CCTCGCTCGG GACCGTCCTC ACCGCGCTAC CGGGACGCCT CAAGGCCGCC GCCAGCGCGG CCAAGGCCTG GGTGGCCAGC GCCGGTAGCG CCCTGGTCGG CAGTTTCCGG GCGGCCGGCA GCTCCGCCCT GGCATTCGCC ACCGCGCCCT TGCGGTGGGT GATCAAGGGA CTGCGCGAGG CCGCGCTCGC GGCGTGGATG AATATCCGCG TCAACGGCTT GCTCGGCGCC AGTTGGAACG GCATCAAGGC GGGGGCCGGT GGCCTGCTGG CGGTGCTGCG CGGCGGCTTC TCCGCCGTGC TCGGCGGTGC CGCCGGCGCC TTGCGTCTGT TCGGGCAGGC CATCGTGTTC GCGGGACGTG CCCTGCTGCT CAACCCCATC GGCTTGACTA TCTCGGCGCT GGCCCTGGCC GGCATGGCCT TGCTCAAGTA TTGGCAGCCG GTCAAGGCCT TTTTCGGCGG CTTCTGGCAA GGCTTCACCC AGGGGCTCGA ACCGCTGGCC CCGGCCTTCG CGGCGCTGGG GAGCGCCTTG GCGCCGCTCA AGCCGCTCTG GGACGGTATC GCCGGAGCCA TGTCGGCCGC GTGGCAGTGG GTCAGCCGGC TGTTCGCCCC GTTCCAGGCG ACGGCCACCG AACTCCAGAA TGCCACCAGC CGGGGCCAGG CCTTCGGCCT CTGGCTGGCC GGGCTGGTCA ACTCCCTGAC AGCCATCGCC GGCAAGATGT TCGGCTTCGG GGTCGACATT GTGAAAGGGC TGATCAACGG CATTCTCAGC ATGAAGAATA CCGTCCTCGG TGTTATCGGT GGTATTGGCA GCAGCATCAC CGGCTTGTTC TCGAAGGATC AGGAGATCCA CAGCCCGAGC CGCGTATGGG CCCAGTTGGG CAACTACACG ATGCAGGGCC TCGAACAGGG CCTGCTGAAG GGGCAGGGCG GGCCGCTCGG CGCCATCGCC GATCTGAGTC GGCAGCTCAC CCAGGCCGGC GCGCTCACCG TGGGCCTGGG CGCGGCCGGC GGTGCGCTGG CGATCGACAA CCGCCCGCCG CTGGCCGCTG GCGGTGCGGC GCCCATCGTC GTCCAGGGCG ACACCATCAC GATCAACCTC CAGGTTGGTG CCGGCGGCAA CCCGGCCGAC CTGGCGCAAC AGATCAACCG CATCCTCGAC GAGCGCGAGC GGGCCAAGGC CGCCCGCGTG CGCTCCCGCC TGCACGATCA GGAGTGA
|
Protein sequence | MARDLSLQVR LSAIDRITAP LRRIVQGSGA LAQAMKASQD QLKALNQQQR DLSGYRQTNV EIARQTKAIQ ALQARTREHT QLLEKQRAVH VNLKGNLKAA QTQYNKLAKA LIEGKGETAS FHFELEKAQI KLQSAQQAFN RSSSTIKTYK DRIRQADSQL AQLGSQQQNS QERLAGYKRR LDEAGIGTER LGSRARQLRG EQERLNAVLE AQKARLAAVT AQQERLTKAQ KSYERAQAVA GKIAIGGAAS LANGYALSRP LSAVMDAYAP AEDAAAQLRA SMMGADGSVS ADFEKISALA TRLGDRLPGT TAQFQEMMTM LRRQGISAQS ILGGTGEAAA YLAVQLKMGS SEAAEFAAKM QDSTRTTEAD MMGLMDTIQR TFYLGVDPTN MLQGFASISP ALSMIRKSGL EAANTLAPLL VMMDQAGMSG ESAGNALRNV FQSGFKTDKV AKANKMLKKL GISLDFTDGK GEFGGLEKLF AQLQKLQKLT TEKRTAVISE IFGDDSQNLQ VLNTLIDKGL DGYREVEAKM KAQADLRKRV DDQLKTLTNV IDAAQGSWTN AMAEFGAAVA PELKGLIQWL GNVASGIGAW ARENPQLAGT LVKVTAGIGA LAAAGGALAI GMAGLIGPFA MAKLGLSVFG IQAGSAMAST GLLGKALGGL STSLSGLGAA WQAASLGTVL TALPGRLKAA ASAAKAWVAS AGSALVGSFR AAGSSALAFA TAPLRWVIKG LREAALAAWM NIRVNGLLGA SWNGIKAGAG GLLAVLRGGF SAVLGGAAGA LRLFGQAIVF AGRALLLNPI GLTISALALA GMALLKYWQP VKAFFGGFWQ GFTQGLEPLA PAFAALGSAL APLKPLWDGI AGAMSAAWQW VSRLFAPFQA TATELQNATS RGQAFGLWLA GLVNSLTAIA GKMFGFGVDI VKGLINGILS MKNTVLGVIG GIGSSITGLF SKDQEIHSPS RVWAQLGNYT MQGLEQGLLK GQGGPLGAIA DLSRQLTQAG ALTVGLGAAG GALAIDNRPP LAAGGAAPIV VQGDTITINL QVGAGGNPAD LAQQINRILD ERERAKAARV RSRLHDQE
|
| |