Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_0456 |
Symbol | |
ID | 6743250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | - |
Start bp | 396361 |
End bp | 397749 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 642750249 |
Product | aromatic hydrocarbon degradation membrane protein |
Protein accession | YP_002121124 |
Protein GI | 195952834 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG2067] Long-chain fatty acid transport protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00000969908 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA GGCTTCTTGC TTTGGCGGTG TTTACAGGTT TAATTGGAGT GTGGCAAACA GGTGCTTTTG CCACAAATGG CGATGATCTT ATAGGCGTTA CACCAAACTC AGAGGCAATG GGTGGCATAG GTGTTGGTAT GCCTGTGGGT TCTGTGGACT CTATCTTTAG AAACCCAGCT TGGATGAGCG TAGCAAGATC AAACAAAGTC CAGTTTGGCG GTATGCTTTT TATGCCAAGC GTAAAAGCTG AAAGCAATAA TGCATTTATA GGCACTTCTG GTGCACCATC ATCTGCTTCT GCTGATAGCG ACGCAAACCT CTTTTTAGTG CCAGAGATAG GTATAGTGGA CAAGATAAAC GATAAACTTG TATTTGGTAT AGGTGCTTTT GGTGTATCTG GTATGGGTAC AGATTACAAC GGTAAGGGTC CGCAGCAGAT TATAGGGCAA AACGCTCAAG GCCCAGTTAC AATTCCAGCT TTTTACAACA TGAGAACAAC GCTTCAGTTT ATGCGTATAA TACCAGCCTT ATCTTACCAG ATAAACCCAA TGATAAGCGT AGGTGCCGGT ATAGATTTTG CCTACGGATC ACTTGATATG AACGCTACTA TGCCAAATGA ATGTACACCT TTAATGCAAC CTCCTTATCT CTATTGTACC AGCAAAGCAT CTTACGGTGG GGGTCAAAGT TCGGCACTTG GTATAGGTGG TCAGCTTGGT GTAGCCTTTA ACTTCGGGAA TTTTGTATAC GCTGGTTTAA ACTATCAAAC ACCTATATCT ATGACATATA GACATGTGTT TGATTTTCAA GGAGCATCCC ATTATAATTT TACTGGAAGT TTCCAAGATC TTAAACTTGA ACAACCTCAA GAGCTTGCTT TTGGTGTAGG TATAGCACCC ACTAACAAAT GGAACGTAGG TGCTGATGTT AGATGGATAA ACTGGTCAAA TGCCGATGGT TACAAAGACT TTGGCTGGAA AGACCAATGG GTATTTGCTC TTGGTACATC TTACAAGCTA ACACCAAAAC TCACATTAAG AGCTGGTTTT AACTATGCAA GAAGCCCTAT AAGAAGCAAC AATTTTGGAT CAAACATGAG TCAAACGCCA GCCGCCTCTA TAAGCGGAGC CCAGTTTAAT CAATATGCTA TAGACTTCTT TAACCTATAC GGATTCCCAG CTATTACAGA AGAAGCTATA ACGCTTGGTG GAAGCTATCA GTTTACAAAC ACCTTTGGTG TATCGCTTGC TTATGAGCAT GATTTCCAAC ATAGCGTAAC GGATAGTGGA TATTGTTATG ATGGTTTAAA TAACACTAAT AATCCATGCT CTATAGGTGC AAAAAACGCA GAAGATGCCA TAAACGTAGC TCTTGAATGG AGATTTTGA
|
Protein sequence | MKKRLLALAV FTGLIGVWQT GAFATNGDDL IGVTPNSEAM GGIGVGMPVG SVDSIFRNPA WMSVARSNKV QFGGMLFMPS VKAESNNAFI GTSGAPSSAS ADSDANLFLV PEIGIVDKIN DKLVFGIGAF GVSGMGTDYN GKGPQQIIGQ NAQGPVTIPA FYNMRTTLQF MRIIPALSYQ INPMISVGAG IDFAYGSLDM NATMPNECTP LMQPPYLYCT SKASYGGGQS SALGIGGQLG VAFNFGNFVY AGLNYQTPIS MTYRHVFDFQ GASHYNFTGS FQDLKLEQPQ ELAFGVGIAP TNKWNVGADV RWINWSNADG YKDFGWKDQW VFALGTSYKL TPKLTLRAGF NYARSPIRSN NFGSNMSQTP AASISGAQFN QYAIDFFNLY GFPAITEEAI TLGGSYQFTN TFGVSLAYEH DFQHSVTDSG YCYDGLNNTN NPCSIGAKNA EDAINVALEW RF
|
| |