Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_0977 |
Symbol | |
ID | 7315008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 1055005 |
End bp | 1058223 |
Gene Length | 3219 bp |
Protein Length | 1072 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643615862 |
Product | carbamoyl-phosphate synthase, large subunit |
Protein accession | YP_002513052 |
Protein GI | 220934153 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.270817 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTAAGC GTACAGACAT CAACAGCATC CTCATCATCG GCGCCGGCCC GATCATCATC GGCCAGGCCT GTGAGTTCGA CTACTCCGGC GCCCAGGCCT GCAAGGCGCT GCGCGAGGAG GGCTACCGGG TCATCCTGGT GAACTCCAAC CCGGCCACCA TCATGACCGA CCCGGAGATG GCGGACGCCG TCTACATCGA GCCGGTGGAA TGGAAGACCG TGGCGCGCAT CATCGAGCGT GAGAAGCCGG ATGCCATCCT GCCCACCATG GGCGGACAGA CGGCGCTCAA CTGCGCCCTG GATCTGGCGA AGCACGGCGT GCTCAAGAAG CACAACGTGG AGCTGATCGG CGCCAGCGAG GACGCCATCG ACATGGCCGA GGACCGGGAG CGCTTCCGCG ACGCCATGAC CGAGATCGGT CTCGAGTCGC CCATGGCCTT CGTGGTCAAC ACCTGGGATG AGGCCCAGGA GGTGCAGCCG AAGATCGGCT TCCCGGTGAT CATCCGCCCG TCCTTCACCA TGGGCGGCAG CGGCGGCGGC ATCGCCTACA ACCGCGACGA GTACGAGGAG ATCGTCAAGC GCGGCCTGGA CCTCTCGCCC ACCAACGAGG TGCAGGTGGA GGAGTCCGTG CTGGGCTGGA AGGAATACGA GATGGAGGTG GTGCGCGACA AGGCGGACAA CTGCATCATC ATCTGCTCCA TCGAGAACCT GGACCCCATG GGCATCCACA CCGGTGACTC CATTACCGTG GCCCCGGCCC AGACGCTCAC CGACAAGGAA TACCAGATCA TGCGCAACGC CTCCATCGCG GTGCTGCGCA AGATCGGCGT GGATACCGGC GGTTCCAACG TGCAGTTCGC CGTGAACCCG GACGACGGCC GCCTGGTGAT CATCGAGATG AACCCGCGTG TGTCGCGCTC CTCGGCCCTG GCCTCCAAGG CCACCGGCTT CCCCATCGCC AAGGTGGCGG CCAAGCTGGC CGTGGGCTAC ACCCTGGATG AGCTGCGCAA CGAGATCACC GGCGGCGTCA CACCCGCGTC CTTCGAGCCG TCCATCGACT ACGTGGTCAC CAAGGTGCCG CGTTTCACCT TCGAGAAATT CCCCCAGGCC GAGGCCAAGC TCACCACCCA GATGAAGTCG GTGGGCGAGG TGATGGCCAT CGGGCGCTGC TTCCAGGAGT CCTTCCAGAA GGCCCTGCGG GGTCTTGAGA CCGGTGCCGA CGGCCTTAAC CCCATGGTGG ATCTGAACGA CCCGGACGCG GAGATGCTGG TGCGCCAGCA GCTGGTGGCC AACGGTCCGG ATCGCATCTG GTACGTGGGC GATGCCTTCC GCCTGGGGAT GAGTATTGAG CAGGTCCACG AGCTGACCCG CATCGACCGC TGGTTCCTGG CCCAGATCAA CGAGCTGATC TTCACCGAAC AGCAACTCCA GGGCCGCAGC CTCAAGGACC TGGACCGCCT GGCCCTGCTG GATCTCAAGC GCCGGGGCTT CTCAGATCGT CGCCTGGCCA CCCTGCTGGG CGAGAAGGAG CAGGCGGTGC GCGAGCATCG CCATGGTCTG AAGGTGCGCC CGGTGTACAA GCGGGTGGAC ACCTGCGCCG CGGAGTTCGC CTCCAACACC GCCTACATGT ATTCCAGCTA CGACGAGGAG TGCGAGGCCG CACCCTCCAG TCGCGAGAAG ATCATGGTGC TGGGCGGCGG CCCCAACCGC ATCGGTCAGG GCATCGAGTT CGACTACTGC TGCGTGCACG CGGCGCTCGC CATGCGCGAG GATGGCTATG AGACCATCAT GGTCAACTGC AACCCGGAGA CCGTGTCCAC CGACTACGAC ACCTCCGATC GCCTGTACTT CGAGCCCCTG ACCCTGGAAG ACGTGCTGGA GATCGTGGCG GTCGAGAAGC CCAGGGGCGT GATCGTGCAG TACGGTGGCC AGACCCCGCT CAAGCTCGCC CGGGATCTCG AGGCCGCCGG CGTGCCCATC ATCGGCACCA CACCCGACTC CATTGATCTC GCCGAGGACC GGGAGCGCTT CCAGCAGCTC ATCGACAGGC TGGAACTGGT GCAGCCCCCC AACCGCACCG CACGCAGCGT GGAGGACGCC ATCAGCAAGG CCGAGGAGAT CGGCTATCCG GTGGTGGTCC GTCCCTCCTA CGTGCTGGGC GGCCGGGCCA TGGAGATCGT CTACAACGAG GAGGACCTGC GCCGCTACAT GCGCAGCGCC GTGCAGGTCT CCAACGATGC CCCCGTGCTG CTGGACCGGT TCCTGGATGA CGCCGTGGAG GTGGACGTGG ATGCCATCTG CGACGGCGAA GAGGTGCTCA TCGGTGGCGT CATGCAGCAC ATCGAACAGG CCGGCGTGCA TTCCGGCGAC TCCGCCTGCT CCCTGCCGCC GTACTCCCTG GACAAGAAGA TCCAGGACGA GCTGCGCGAG CAGGTGCGCA AGATGGCCTT CGGCCTCAAG GTGGTGGGTC TGATGAACAC CCAGTTCGCG GTGCAGGGCG ATAAGGTCTA CGTGCTGGAG GTCAATCCCC GCGCCTCGCG CACCGTGCCC TTCGTCTCCA AGGCCATCGG CAAGCCCCTG GCCAAGATCG CCGCCCGCTG CATGGCCGGG CGCAGCCTCA GGGAGCAGGG CGTGGACGGC GAGGTGATCC CGTCCTACTT CTCCGTGAAG GAAGCGGTGT TTCCGTTCAT CAAGTTCCCC GGCGTGGACA CCATCCTGGG GCCGGAGATG AAGTCCACCG GCGAGGTTAT GGGTATGGGC CGCAGCTTCG GCGAGGCCTT CGCCAAGAGC CAGCTGGGCG CGGGCGTGAC CCTGCCCCAG CGCGGCAAGG CCTTCGTCAG CGTGCGCGAG GCGGACAAGG GCGGTGTGGC CAAGGTGGGC CGTGCGCTGG TGGAGCAGGG CTTCGAGGTC ATCGCCACCC GCGGCACCGC CGAGGTCCTC AAGGCGGCGG GCATCGAATG CACGCCAGTC AACAAGGTCA CCGAGGGCCG CCCGCACATC GTGGACATGA TCAAGAACGA CGAGATCAGC CTGATCGTGA ACACCACCGA GGGCCGGCAG GCCATCGCGG ATTCATTCAC CATCCGCCGC GAGGCCCTGC AGCACAAGGT CAGCTACACC ACGACCCTGG CCGGCGCGCT GGCCACGGTG TCGGCCTTGA GCTACCTGGC CCACGAGGAA GTCTACAGAT TGCAGGACAT GCATCAGGAG AATCGGTAA
|
Protein sequence | MPKRTDINSI LIIGAGPIII GQACEFDYSG AQACKALREE GYRVILVNSN PATIMTDPEM ADAVYIEPVE WKTVARIIER EKPDAILPTM GGQTALNCAL DLAKHGVLKK HNVELIGASE DAIDMAEDRE RFRDAMTEIG LESPMAFVVN TWDEAQEVQP KIGFPVIIRP SFTMGGSGGG IAYNRDEYEE IVKRGLDLSP TNEVQVEESV LGWKEYEMEV VRDKADNCII ICSIENLDPM GIHTGDSITV APAQTLTDKE YQIMRNASIA VLRKIGVDTG GSNVQFAVNP DDGRLVIIEM NPRVSRSSAL ASKATGFPIA KVAAKLAVGY TLDELRNEIT GGVTPASFEP SIDYVVTKVP RFTFEKFPQA EAKLTTQMKS VGEVMAIGRC FQESFQKALR GLETGADGLN PMVDLNDPDA EMLVRQQLVA NGPDRIWYVG DAFRLGMSIE QVHELTRIDR WFLAQINELI FTEQQLQGRS LKDLDRLALL DLKRRGFSDR RLATLLGEKE QAVREHRHGL KVRPVYKRVD TCAAEFASNT AYMYSSYDEE CEAAPSSREK IMVLGGGPNR IGQGIEFDYC CVHAALAMRE DGYETIMVNC NPETVSTDYD TSDRLYFEPL TLEDVLEIVA VEKPRGVIVQ YGGQTPLKLA RDLEAAGVPI IGTTPDSIDL AEDRERFQQL IDRLELVQPP NRTARSVEDA ISKAEEIGYP VVVRPSYVLG GRAMEIVYNE EDLRRYMRSA VQVSNDAPVL LDRFLDDAVE VDVDAICDGE EVLIGGVMQH IEQAGVHSGD SACSLPPYSL DKKIQDELRE QVRKMAFGLK VVGLMNTQFA VQGDKVYVLE VNPRASRTVP FVSKAIGKPL AKIAARCMAG RSLREQGVDG EVIPSYFSVK EAVFPFIKFP GVDTILGPEM KSTGEVMGMG RSFGEAFAKS QLGAGVTLPQ RGKAFVSVRE ADKGGVAKVG RALVEQGFEV IATRGTAEVL KAAGIECTPV NKVTEGRPHI VDMIKNDEIS LIVNTTEGRQ AIADSFTIRR EALQHKVSYT TTLAGALATV SALSYLAHEE VYRLQDMHQE NR
|
| |