Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_3095 |
Symbol | |
ID | 7316025 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 3245313 |
End bp | 3246338 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643617994 |
Product | glycosyl transferase family 2 |
Protein accession | YP_002515151 |
Protein GI | 220936252 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGACA CCTCCCTCCC GGACACCACC GAGCCCCTCA CCCTGTCGCT GGTGGTGCCC ATGTACAACG AGGAAGACAA CGTCGCGCCC TTCGTGGAAC GGGTCCACGA GGTACTGTCC GGGTACGCCT ATCCCTGGGA GCTGGTGCTG GTGGACGACG GCAGCGCCGA CAACACCAAC GAGCGCATGC GCGCCCAGCG CAAGCAGTTC GGACACCACG TGCGCATCCT CACCCTGCAA CGCAACTTCG GCCAGACCGC CGCCATGCAG GCGGGGATCG ACATCGCCCG GGGCAGCGTG ATCGCCCTCA TGGACGGCGA CCTTCAGAAC GACCCGGCCG ACATCCCCGG CATGGTGCAG CGCCTGGTGG ACGAGGACCT GGACATGGTG GCCGGCTGGC GCAAGAACCG CCAGGACGAC CTGTGGCGGC GCAAGATCCC CTCGCGCATC GCCAACAAGC TGATCCGCTC CATCACCGGC GTGAACCTGC ACGACTACGG CTGCAGCCTG AAGGTATTCC GCGCCCACAT CCTCAAGGGC GTGCGCCTGT ACGGCGAGAT GCACCGCTTC ATCCCCGCCT GGTTCGCCAC CCAGACCTCG CCGAAACGCA TCCGCGAACA CGTGGTGCAG CACCATGCCC GCACCCGGGG CGCCTCCAAG TACGGCATTT CCCGTACCTT CCGGGTGATC CTGGACCTGC TGGCGGTGTA CTTCTTCATG CGCTTCAAGG CCCGCCCCGG CCACTTCTTC GGCTCCATCG GCCTGGTGTT CGGCGGCCTG GGCGCCGTGA TCCTGGGCTA CCTGCTCATG CTCAAGGTGT TCTGGGACGC GGACATCGGC ACCCGCCCGC TGCTGTTCGT CGGCGTGATG TGCGTACTGG TCTCGGTACA GTTACTCACC ACCGGCGTAC TGAGCGAACT GACCGCCCGC ACCTATTTCG AATCGGGGCA GCAACGCTCC TACGTGGTGC GCTCCAGCGT GCACGAACAG GCCGGCGAGA CCGACTGGCA TCACGGACGT GGCTGA
|
Protein sequence | MIDTSLPDTT EPLTLSLVVP MYNEEDNVAP FVERVHEVLS GYAYPWELVL VDDGSADNTN ERMRAQRKQF GHHVRILTLQ RNFGQTAAMQ AGIDIARGSV IALMDGDLQN DPADIPGMVQ RLVDEDLDMV AGWRKNRQDD LWRRKIPSRI ANKLIRSITG VNLHDYGCSL KVFRAHILKG VRLYGEMHRF IPAWFATQTS PKRIREHVVQ HHARTRGASK YGISRTFRVI LDLLAVYFFM RFKARPGHFF GSIGLVFGGL GAVILGYLLM LKVFWDADIG TRPLLFVGVM CVLVSVQLLT TGVLSELTAR TYFESGQQRS YVVRSSVHEQ AGETDWHHGR G
|
| |