Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1950 |
Symbol | |
ID | 7316339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 2071068 |
End bp | 2072498 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643616843 |
Product | protein of unknown function UPF0027 |
Protein accession | YP_002514018 |
Protein GI | 220935119 |
COG category | [S] Function unknown |
COG ID | [COG1690] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00816316 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCCGA ATCGCTTCAC GAAACTCTCC GACACCGCCT GGCAGCTTGA GCCCACGGGC AAGATGCGCG TGCCGGCGAT CCTGTACGCC AGCGAGGCGT TGCTGCGGGA GATGGACGAC AAGGTGGCCG AGCAGGCGAC CAACGTCGCC ACCCTGCCCG GCATCGTGCA GGCCAGCTAC GCCATGCCCG ACGCCCACTG GGGCTACGGC TTTCCCATCG GCGGCGTGGC CGCCTTCGAC GCAGACGCGG GCGGCGTGGT TTCGGCTGGC GGCGTAGGCT TCGACGTCTC CTGCGGCGTG CGCACCCTGC ACACGGGGCT GACCCGGGAG GCGATCGAGA AGATCAAACC GGCCCTGGCC GATGCCCTGT TCGAGTCCAT CCCGGCAGGA CTGGGCAGCA CAGGCTACAT CCACCTGCGG GACCATCAGA TGACGGAGAT GCTGGCCGGC GGCGCGGTCT GGGCGGTGCA ACAGGGCTAC GGCGAGGCGG CGGACCTGGA ACGCATCGAG GAACACGGCC GCATGGCCGG TGCCGATCCC CACGCGGTCT CGGAGCAGGC GCGCAAGCGC CAGCGCAACG AGATGGGCAC CCTGGGTTCA GGCAATCACT ATCTCGAGGT GCAGCACGTC ACCGAGATCT ACGATCCCGC CGTGGCCAAG GTGTTCGGCC TGGCCGTGGG CCAGGTGGTG GTGAGCATCC ATTGCGGCTC CCGGGGCCTG GGCCACCAGA TCGGCACCGA GTTCCTGCGC GAGATGGCGG TGGCGGCGAA CCGCCACGGC ATCGAGCTGC CGGACCGGGA ACTGGCCTGC GCGCCCATCC GCTCGGAACT GGGCGAGCGC TACCTGGGCG CCATGCGCTC GGCCATCAAC TGCGCGCTGG CCAACCGCCA GATCCTCACC CACCTGACCC GGCGCGTGTT CGCGAAGGTC CTGCCCGAGG CGCGCCTGGA CCTGCTCTAC GACGTCTCCC ACAACACCTG CAAGGTGGAG ACCCACAGCA TCGACGGCAG CCCTCGCCAG CTCTACGTGC ACCGCAAGGG CGCCACCCGC GCCTTCGGCC CCGGCCACCC GGACCTGCCC GACGCCCTGC GCCCGGTGGG CCAGCCGGTG CTGATCGGCG GCTCCATGGG CACGGCCTCC TACATCCTGG TGGGCACCAA CGAGGGCGAA CGGCTGTCCT TCAACTCCGC CTGCCACGGC GCGGGCCGGG CCATGAGCCG GCATGCCGCG ACCCGCCAGT GGCGCGGCCG CGCGCTGGTG GATGAGCTGG CCGGGCGCGG CATCCTGATC CGCAGCCCCA GCCTGCGCGG CGTGGCCGAG GAGGCGCCCG GGGCGTACAA GGACGTGAGC GAGGTGGTGA AGGCGACCCA CCAGGCGGGC CTGGCGAGGA TGGTGGCGCG GGTGGAGCCG TTGGTGTGCA TCAAGGGGTA G
|
Protein sequence | MDPNRFTKLS DTAWQLEPTG KMRVPAILYA SEALLREMDD KVAEQATNVA TLPGIVQASY AMPDAHWGYG FPIGGVAAFD ADAGGVVSAG GVGFDVSCGV RTLHTGLTRE AIEKIKPALA DALFESIPAG LGSTGYIHLR DHQMTEMLAG GAVWAVQQGY GEAADLERIE EHGRMAGADP HAVSEQARKR QRNEMGTLGS GNHYLEVQHV TEIYDPAVAK VFGLAVGQVV VSIHCGSRGL GHQIGTEFLR EMAVAANRHG IELPDRELAC APIRSELGER YLGAMRSAIN CALANRQILT HLTRRVFAKV LPEARLDLLY DVSHNTCKVE THSIDGSPRQ LYVHRKGATR AFGPGHPDLP DALRPVGQPV LIGGSMGTAS YILVGTNEGE RLSFNSACHG AGRAMSRHAA TRQWRGRALV DELAGRGILI RSPSLRGVAE EAPGAYKDVS EVVKATHQAG LARMVARVEP LVCIKG
|
| |