Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_3038 |
Symbol | |
ID | 7315967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 3185483 |
End bp | 3187195 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643617936 |
Product | hypothetical protein |
Protein accession | YP_002515094 |
Protein GI | 220936195 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTCAG GTAGAACACT GTCCGGCAAC TACCCCGGAC ATCCCAAGGC AACGGGCATG AGCTGGCTGA CGCCGGCAAC GCTGCTGGTT GCAAGCCTTG CATTGGCCGG CTGCGGGGGC GGCGGCGGTA GCAGCAACCC CGCCGACCCC GTCATTTCCA CCACCGCCAT CGATGGCGCT GTGAGCAAGG GCCCGGTGGT CGGCGCCCAG GTGCTGCTCT ACCTTGCGGG CGCGGACGGT GGGCATACGG GTGAGCCCGT GGCCGGGCCG TTTACCACGA TCGCGGACGG CACCTGGGAT GATGAGATCC CCGAAGAGCT GCCCCGTCCC CTCGTGGTCA TCGCCACCGG CGGCAGCTAT ACCGACGAAG CCACCGGAGA CACCGTGGAA CTCGGCAGCC GCAGCCTGCG CAGCTATCTG CCGGCTGACG CCGATACGGT GGCCGTCACC CCGCTCACCG AACTGCTGGT GCGCGTGACC CAGGAGCAGA TCGCCGATAG CCCTGACACG ACCATCGAGA ATGCGCTGGA TGCGGCGAAA AACACACTGA ACCAGGCCCT GGCCATCAAC TTCGATCCGC TGACCACCAA GCCGCTGGAC ATCAACAACG CCGGTGACGG TGATCGTGAC CAGCGCGCCT ACACCGCCAT CCTGGCGGGC CTCTCCGTAC TCGCCAACAA CCTGGCACCC ACTGCAGACC CGTTCGAGGT GGTGCAGGCC CTGATTGATG ACATGAGCGA CGGCACCCTG GACGGACAGA AAGCCGGCGA GCCGGTGCCG GTGGGAGACA GCGAAGACAC TCTGCCAACC ACCACGACAA CCGATCTGCT CATCGCCATC AACACTGCCA TTGACGAAGC CGACGATTCC TCTGCCTTCG ATGACGTGGC CGTGAGTGAG GACGGCGAGG GCAATCTGGT CATCTCCCCC CGTTACACCC TGGGTGGAAC CGTCAGCGGT CTCCTTGGCA GCGGCCTGGT GCTGCAAGAA GGCGCTCTCG AGCTGCCCAT CGAAAGTTCC GGCAGCTTCA CCTTCAGCGG CTTCTTCGAT GCAGAAACCA GTTACTCGGT GACCGTCCTG ACCCAGCCCA CCAGCCCCAT GCAAACCTGC TCCGTGACGA ATGGCTCTGG CGAGGTATCA GAGGATGTGA CCAACATCGT TGTCAGCTGC GAGACCGACA CCTTTACGGT GGGCGGCACC GTGAGCGGAC TTGATGCCGA AGAAAGTCTG ATCCTGCAGA ACAACGGCAC GGACAGCCTG CCAGTGACAA GCAACGGCGG ATTCGTCTTC GACACCCCCG TACAAGACCA GGCCCCCTAC AACGTCACCA TCCTCACGCT GCCCGAGAGC GGTCAGATCT GCGGCGTGAT CAACGGCATC GGCAATATCA ACGCCGCGGC TGTCGACGAC GTCGTAGTCA GTTGCGACCT GATCACCGTA TCGATGGACG TCATCAATGG CTCATTCAGC TCACAGGGCA GTCAGGTGGT CGACGGCACC CTGGTACTGC TTATCCAGCC CGAACCCGGC TACGCCCTTG TCGAAGGCAG TGTCGAGGTG GTCGGCGAAG GCTGCAGCGG CGAACTGGAC GGCAATCTCT ACACCGTGAC CCTGGGCAGC GAAGCCTGCA CGGTCACCGC GGAGTTCGAA GAGGCGCCAC TCGACGGTGC CACCTGGGGC AATTTCAACT GGGGTGAGGC CAACTGGCAG TAA
|
Protein sequence | MTSGRTLSGN YPGHPKATGM SWLTPATLLV ASLALAGCGG GGGSSNPADP VISTTAIDGA VSKGPVVGAQ VLLYLAGADG GHTGEPVAGP FTTIADGTWD DEIPEELPRP LVVIATGGSY TDEATGDTVE LGSRSLRSYL PADADTVAVT PLTELLVRVT QEQIADSPDT TIENALDAAK NTLNQALAIN FDPLTTKPLD INNAGDGDRD QRAYTAILAG LSVLANNLAP TADPFEVVQA LIDDMSDGTL DGQKAGEPVP VGDSEDTLPT TTTTDLLIAI NTAIDEADDS SAFDDVAVSE DGEGNLVISP RYTLGGTVSG LLGSGLVLQE GALELPIESS GSFTFSGFFD AETSYSVTVL TQPTSPMQTC SVTNGSGEVS EDVTNIVVSC ETDTFTVGGT VSGLDAEESL ILQNNGTDSL PVTSNGGFVF DTPVQDQAPY NVTILTLPES GQICGVINGI GNINAAAVDD VVVSCDLITV SMDVINGSFS SQGSQVVDGT LVLLIQPEPG YALVEGSVEV VGEGCSGELD GNLYTVTLGS EACTVTAEFE EAPLDGATWG NFNWGEANWQ
|
| |