Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_2711 |
Symbol | |
ID | 7269618 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 3325221 |
End bp | 3328205 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643567537 |
Product | hypothetical protein |
Protein accession | YP_002464015 |
Protein GI | 219849582 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG0160] 4-aminobutyrate aminotransferase and related aminotransferases [COG0739] Membrane proteins related to metalloendopeptidases [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0583099 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATGGT CAACCGCCCT TGCCCGCCAC TACCACATCC ACGGCACGGT AACATCACTC CCCGGTGAAT ATGATCTCAA CCTGCTCGTT ACTGCGAGCG ATGGCATGCA GTACGTGTTG AAGGTCATGC GTCCGCACTG CGATCCGACA CTCGTTGATC TCCAGTGTGC AGCGTTGGAA CACCTCGCCG TTGTTGCGCC CGATCTTCCT GTTCCACGCT TGATCCGCAC CACCACCGGT GAGCGGTTTG TCGCCGAGCA AGATCGGCTC ATCTGGCTGA TCACCGCACT GCCGGGCGAA CACTATGCTA CCTTCCGCCC TCGCACCACC ACCTTGCACA CCGAAGTAGG CCGGCAAGCT GCTCGGCTCG ATCAGGCACT GGGTAGCTTT ACCCATCCCA CCCTGCACCG ACCACTCAAG TGGAATCTGC TTCAAGCTGA TTGGGCTACC ACCGCCCTCG CTGCCATCAC CGACCCCACC CAACGCGACC TTGCCGCTAC TGCACTTGCC GCCTACACCG ATCTGCGTCC AAACCTGCTG CAACTACCAC TTACGATTAT TCACAACGAC CTTAACGACT ACAACCTGCT CGTCATGGCC GATAGCTATG GTCATTTAAC CCTGAGTGGT ATTCTTGACT TCGGCGATTT GTGCGCTGCA CCGCGCGTCT GCGAACTCGC GATCGCGGCG GCGTATGCCC TGTTTGACCA ACCCGATCCC ATACGGACAC TAGGCGAGTT GGTTGCCGGG TACCATAGCG TCTGGCCGCT GACCGCTGCC GAGATCGATC TGATCTGGCC GCTGATCCGC ACCCGATTGG CGGTAAGCGT GGTCAACTCG GCGCTGATGA AACAACAACG CCCCGATGAC CCCTACGTAA CGATCTCGGA AGCACCGGCG TGGCGCTTGC TGGCTGCAAC CGCCCACAAT AACCCCGCAC TAGTTGCTGC CCGTTTACGC GCTATTTGCA ACTTGCCAAT ATCCGACGCT GCCGAACGGG TTATGGTCTG GCTTGATGCG CAACGTGGTC GCTTTGCACC GGTGATCGGC TGCGATCTAG CCACGGTTCC GGTGATCAGC CTTGCGGTCG GTGAAGCACC GCTGCCGCAG GATCCGTTCC AGCTCACGAC AACAGAAGCT GAAGCACTGA CCGGTAGCAA CACCGGTGTA TGCATTGGCC GTTACGGTGA GCCACGCCTT ATCTATACCG CTGCCACATT CTGGACCGCT CCCCAACCGC TGGCCGAACG CCGCACCATC CATCTCGGCG TCGATCTGTT TGCTCCGCCC GGCACTCCGG TCTGCGCTCC ACTCGATGGC GAAGTCGTGG CCGTCGAACA TCTCTCCGAC CGACTCGATT ACGGTGGTTT GGTCATCCTC AAGCACTACA CTCCAAACGG TGATCCGTTC TACACCCTCT ACGGCCATCT CGAACCAATC TGCTTCAAAC GTCTCACCGT CGGCCAGCAC ATTGCTGCCG GGCAACTGTT TGCCGCCCTC GGTATGAACA GCAACAACGG CGGATGGCAA CCACATCTCC ACTTTCAACT GATCCTACTC TACGACGCCG TTGCCGGACA CTGGCCCGGT GTTGCGGCTG CTAGTGAATG GACCTGGTGG CACGCGGTTT GCCCTAATCC GGCAGCCCTG CTGAACCTCC CGGACGAGCG CGTCGCCTAC CAACCGCTCG ATCAGGTCAG TCTCCTACAT GAACGACGCA CACACTTTGC CGGCAATTTG CGCTTGAGTT ACCGCGAACC ATGCACCTTC TTACGGGGCT GGCGTCACTA TCTATTCGAT GAGTACGGCC AAACGTATCT CGACGCCTAC AACAATGTTC CTCACGTCGG CCATGCGCAC CCGCGCATCC AAGCAGTCGC AGCGCAACAA CTTCGTCTGC TCAACACCAA CACACGCTAT CTACACCCGG CTCAGATCGC ATTTGCCCAC GAACTGCTCG CCAAACTGCC TCCATCATTG AGTGTCTGTT TCTTCGTCAA CTCCGGCTCC GAAGCCAACG AACTGGCGTT ACGGTTAGCG CGCACCTACA CCGGTGGGCA CGATATGATT ACCATCGACC ACGGCTATCA TGGCCACACC ACCGGTGCAA TCGCCATCTC GGCCTACAAG TTCAACCACC CTGCCGGCAA CGGTAAGCCC GACTGGGTAG AGGTAGTCAT GGCTCCCGAC CCCTACCGTG GCCCTTACGG CCACGATGGC CCCCGCTACG CCGCTGAAGT TGATCAAGCC ATCGAACGGA TCGCTACACG CGGTGGCAAA CTCGCCGGCT TCATCGCCGA GACCTTCCCC AGCGTCGCCG GACAGATTAT TCCTCCACCC GGCTACCTCG CTGCAGTGTA TCAACGTATT CGCGCCGCCG GCGGTGTTTG CATTGCCGAC GAAGTGCAGA CCGGATTGGG ACGACTCGGT ACGCACTATT GGGCGTTCGA GAGCCAAGGG GTAGTGCCTG ATATCGTGGT ACTCGGCAAA CCGCTCGGCA ATGGGCACCC TATCGGAGCG GTGATCACAA CCGTTGAGAT CGCCCGTGCC TTCGATAACG GCCTCGAATT CTTCTCGACG TTTGGCGGCA GTACGCTCTC TTGTGTCATC GGCCGCGAAG TGCTGCGCAT CATCGACGAA GAAGGCCTGA TGGATAACGC TCACTGCGTT GGTCAAGTCC TGCTCACCGG TCTGCGTGAC CTACAACACC GCCATCCCGT TATTAGTGAT GTGCGGGGAA TGGGCCTATT TATCGGCGTT GAGTTGGTTA CCGATCGTAC AACACGCACA CCAGCAACCG CGGCTGCTCG TTATGTGCGA GAGCGTCTAC GCGCCGAACG CATCCTCATC GGCACCGAAG GTCCAGCCGA CAACGTGTTG AAGATTCGCC CACCACTTAC CTTCGATCTC GCCGCAACCA CCGTCTTTCT GGAGCGACTC GATGCTATTC TCGGCGAGAG TTTTATTGCA CGACAGGGAG GTTAA
|
Protein sequence | MPWSTALARH YHIHGTVTSL PGEYDLNLLV TASDGMQYVL KVMRPHCDPT LVDLQCAALE HLAVVAPDLP VPRLIRTTTG ERFVAEQDRL IWLITALPGE HYATFRPRTT TLHTEVGRQA ARLDQALGSF THPTLHRPLK WNLLQADWAT TALAAITDPT QRDLAATALA AYTDLRPNLL QLPLTIIHND LNDYNLLVMA DSYGHLTLSG ILDFGDLCAA PRVCELAIAA AYALFDQPDP IRTLGELVAG YHSVWPLTAA EIDLIWPLIR TRLAVSVVNS ALMKQQRPDD PYVTISEAPA WRLLAATAHN NPALVAARLR AICNLPISDA AERVMVWLDA QRGRFAPVIG CDLATVPVIS LAVGEAPLPQ DPFQLTTTEA EALTGSNTGV CIGRYGEPRL IYTAATFWTA PQPLAERRTI HLGVDLFAPP GTPVCAPLDG EVVAVEHLSD RLDYGGLVIL KHYTPNGDPF YTLYGHLEPI CFKRLTVGQH IAAGQLFAAL GMNSNNGGWQ PHLHFQLILL YDAVAGHWPG VAAASEWTWW HAVCPNPAAL LNLPDERVAY QPLDQVSLLH ERRTHFAGNL RLSYREPCTF LRGWRHYLFD EYGQTYLDAY NNVPHVGHAH PRIQAVAAQQ LRLLNTNTRY LHPAQIAFAH ELLAKLPPSL SVCFFVNSGS EANELALRLA RTYTGGHDMI TIDHGYHGHT TGAIAISAYK FNHPAGNGKP DWVEVVMAPD PYRGPYGHDG PRYAAEVDQA IERIATRGGK LAGFIAETFP SVAGQIIPPP GYLAAVYQRI RAAGGVCIAD EVQTGLGRLG THYWAFESQG VVPDIVVLGK PLGNGHPIGA VITTVEIARA FDNGLEFFST FGGSTLSCVI GREVLRIIDE EGLMDNAHCV GQVLLTGLRD LQHRHPVISD VRGMGLFIGV ELVTDRTTRT PATAAARYVR ERLRAERILI GTEGPADNVL KIRPPLTFDL AATTVFLERL DAILGESFIA RQGG
|
| |