Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_3242 |
Symbol | |
ID | 7315617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 3388671 |
End bp | 3389786 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643618143 |
Product | domain of unknown function DUF1745 |
Protein accession | YP_002515297 |
Protein GI | 220936398 |
COG category | [S] Function unknown |
COG ID | [COG4398] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0186777 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCCTT TCTCCATGGC CGCCGCCGAA GGCATCGAAC CCGTCGCCGT GGCCCGCGAA CTCGCCGCCC GGCTGACCAC CCCCGCCGAG GGCGAGGCCC TGGGCTTCCT GTACGTGAGC GATGCCCTGG CCCGGCGCCT GCCCGAACTG CTCGCGACCC TGCGCGAGGC CACCGGCGTG ACCCACTGGA CCGGCACGGT GGGCATCGCC CTGTGCTGCA CCGGGCGGGA GATCTACGAC CGCCCCGCGG CCGTGGCCAT GCTGGGCAGC TTCCCGGCGG GCAGCGCGCG TGTTCTGCCG CCCATGACCG AGAACGACGA CGCCCTCACC GCACTGCTGG CGGACTGGGA CACACAGGAC CCGGCGCGCT TCGCCCTGCT CCACGGCGAC CCCACCCACG GCACCACACC GGCGATCATC CGCCACCTGG GCGAGCACAC CCAGATCTTC GGCGTGGGCG GCATCGCATC CTCCCAGGGC GACTACCTGT CCGTGGCCGA CGAGGTGGTC CAGGGCGCGG TCTCCGGGGT GCTGTTCGCC GAGTCCGTGC CCGTAGCCAC GGCCCACACC CAGGGCTGCA CGCCCATCGG CCCCGTGCAC CGTATCACCG CCGCCAGCAA CAACGTGCTC ATCCAGCTGG ACGGACGCCC GGCCCTGGAC GTCTTCGAGG AAGACATCGG TGAGGTACTG GCCCGTGACC TGCAGCGGGC GGGCGGCTTC ATCGTCGCCG GCCTGCCGGT GCCCGGTTCC GACACCCGCG ACTACCTGGT GCGCAACCTG GTGGCCGTGG ACACCGGCCA GCGCCTGGTG GCCATCGGCG AGCACGTGCG CGAGGGCGAC GAGATCCTGT TCTGCCGGCG CGACGGCAAC GCCGCCCTGG AAGACCTCAA ACGCATGCTC GCCGACCTCA AGCGCCGCGC CCCCAACGGC GCCCGCGGCG CGGTCTACGT CTCGTGTCTC GGCCGTGGCC GGCACCAGTT CGGCGACGAC TCACAAGAAC TGCGCATCAT CGCCGAGGAA CTGGGCGACA TCCCCCTGGT CGGATTCTTC GCCAACGGCG AGATCTTCCA CAATCGGCTG TATGGCTATA CAGGCGTGCT GACGTTGTTC CTCTAA
|
Protein sequence | MTPFSMAAAE GIEPVAVARE LAARLTTPAE GEALGFLYVS DALARRLPEL LATLREATGV THWTGTVGIA LCCTGREIYD RPAAVAMLGS FPAGSARVLP PMTENDDALT ALLADWDTQD PARFALLHGD PTHGTTPAII RHLGEHTQIF GVGGIASSQG DYLSVADEVV QGAVSGVLFA ESVPVATAHT QGCTPIGPVH RITAASNNVL IQLDGRPALD VFEEDIGEVL ARDLQRAGGF IVAGLPVPGS DTRDYLVRNL VAVDTGQRLV AIGEHVREGD EILFCRRDGN AALEDLKRML ADLKRRAPNG ARGAVYVSCL GRGRHQFGDD SQELRIIAEE LGDIPLVGFF ANGEIFHNRL YGYTGVLTLF L
|
| |