Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1691 |
Symbol | |
ID | 7315713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 1790231 |
End bp | 1791343 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643616583 |
Product | CapA family protein |
Protein accession | YP_002513761 |
Protein GI | 220934862 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACGACG TGATCACCCT GTTTCTGTGC GGCGATGTCA TGACAGGCCG GGGCATCGAC CAGGCCCTGC CCCACCCGGC GCCGCCCGAC CTGTACGAGC TCTGGGTGCA GGATGCCAGG GAGTACGTGA GTCTTGCCGA GGCCGCCAAC GGCGAATTCC CGAAGCCGAT GGCGCCCGCC CACATCTGGG GCGATGCCCT GGCCGAACTG GCGCATTTCA GGCCGCACTT GCGCCTGATC AACCTGGAAA CCGGCATCAC GCGCCACGGC ACGCCCTGGC CGGACAAGGG CATCCATTAC CGCATGAGTC CCGAGAACGC CGCCTGCCTG CAGGCGGCAA GGATCGACTG TTGCTCGCTG GCCAACAACC ACGTCATGGA CTGGGGCGTG GATGCCTTGC GGGAGACCTG CCGGGTGCTG GACGGACTGG GCATCGCCCA CGCCGGTGCC GGCGAGGATC TGGGGGCAGC GATGAAACCG GCACGCCTCG ACGTGACCAC GGGGCACCGG GTGTGGGTCT TCAGCCTCGG CATGACCGAC AGCGGCATCC CGCCCGAGTG GCATGCGGCC AGGGGCCGGC CAGGCGTGTG GCTGGCCACG GAGGCCTCGG CAGCCGGTGC GGAACCGGTA GCCGAGCGGA TCCGGGCGCA CAAGCGCCCC GGTGACCTGG CAGTGGTGTC GGTGCACTGG GGCAGCAACT GGGGTTATCG CATCCCGAAG GGACACCGTG AATTTGCCTA CCGGCTGGTC GAGGCCGGCG CGGACCTGAT CCACGGGCAT TCCTCCCACC ACCCGGTCGG CATGGAACTC TACCGGGGAC GTCCTATCCT CTATGGCTGT GGAGACTTCA TCAACGACTA CGAGGGCATC CGGGGTTACG AGATTGTTCA GCCGGATCTG ACCCTGATGT TCTTCCCGGC CTTCGATGCG GCGACCGGCG TGTTGAGATC CATGTCCATG ACCCCCCTGC GCCGGGCGCG TTTCTCCCTG CACCGCACCG GCAAGGACGA TGCCCGCTGG CTGTGCGAAA TGCTCAACCG GGAGCGCCAG GGAGATGATC CGGAATTCCG GCTGGATGAT GCAGGGCGGA TACAGTGGGA TCTTGCGCAG TGA
|
Protein sequence | MNDVITLFLC GDVMTGRGID QALPHPAPPD LYELWVQDAR EYVSLAEAAN GEFPKPMAPA HIWGDALAEL AHFRPHLRLI NLETGITRHG TPWPDKGIHY RMSPENAACL QAARIDCCSL ANNHVMDWGV DALRETCRVL DGLGIAHAGA GEDLGAAMKP ARLDVTTGHR VWVFSLGMTD SGIPPEWHAA RGRPGVWLAT EASAAGAEPV AERIRAHKRP GDLAVVSVHW GSNWGYRIPK GHREFAYRLV EAGADLIHGH SSHHPVGMEL YRGRPILYGC GDFINDYEGI RGYEIVQPDL TLMFFPAFDA ATGVLRSMSM TPLRRARFSL HRTGKDDARW LCEMLNRERQ GDDPEFRLDD AGRIQWDLAQ
|
| |