Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acry_2097 |
Symbol | |
ID | 5161881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidiphilium cryptum JF-5 |
Kingdom | Bacteria |
Replicon accession | NC_009484 |
Strand | + |
Start bp | 2319702 |
End bp | 2322776 |
Gene Length | 3075 bp |
Protein Length | 1024 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640554019 |
Product | hypothetical protein |
Protein accession | YP_001235215 |
Protein GI | 148261088 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCGAA CGGTCCTGGT TCTGGCCGCC CTTGCCGGCC TGGCGCTGTT GAGCGCGTGC ACGCGGCCGC CGCCGGGCGC CTATGCGCAT GACACGGATT CCCCGCAATC GGCTAGCGGC CGGCAACTCG CGCTGGGCAA AAATACCAGC GGCGAAACCT GCACCGCCAG CCGCACGACC TCCGGCGGCG CATTGATCTA CTGCGGCAGT TGGCACCAGC CCAGCGCCGA AGTGCATGCG GGTCCGGAGG CGAACCGGGC GAACCTGAAT GCAATCGTTA CGCGAGGAAA CTGGCGGGCC GGTCTCGAAC AGCGCTACGC GTGCGGCAAT CCAAAGCCGA CAACGGTGCT CGGGCACTAC CCTGGAGAGG TTCTGCTCTG CACGCAGCGG ATCGGCGGCT GGCCGCATGT CGCCCTGGCA GCCTTGATCC GCGGCCATGT GTGGTTTGCC GATGGCGTGC AACCGGCTTT TCCAGCCATG GAACGCGCCG TCGGCGTGAT CAGCGGCGCA GTGCCAGCCG CCGATGCCGG CGCGATGACC ACTTCGGTTT CCGACAGCCA GCTTGCCGAC CGCCTCGCCG CCGCCGCCTT CACCTCGGGT GATATCCACG AGTACGGCAA GTTGATGACG CTCGGCAACC GCTCCAACCA GGCCGAGGAT TACCCTGCCG CCATCACCGC CTACCGGGCC GCGCTCGCGC TGCAGCAAAA GGCGCTTGGG CCGGAAAATC CGAATACGGT GACGCCGATG CTGGATCTTG CCCTGAATCT CTCGGATCAG GGCGATTATA GCCAGGCTGA CGCCCTGCTT GCGCAGGCGG CGAAGCTGGC TCCCCTGGCC GTGGATCCCA CGGCGGTCGC GCGACTCGAC CATTATCGGG GTCTCGATCA ACTCAATCAG GGTCATGATC GCCGTGCGCT GCGGCTGCTC TCGGCCGCAA ACCGGGATTA TGCATCGCTT CTGCCCCGCT CCGTGTTGCG CCCCCACCCT GCCGCGACCG GGTCCGACGA TGGCTTCGGC CTTTCGGCCG GAAGCCGGGG GGGATCGCTC CTCGCCGCGC AGTCGACCCT GTTGAGCCCG ATCGGGCAGC AGGCCCTGCT CGGCGTGATC GAAACACTGC GGTATGAGGG CGTGGTGCTC GATGCGGCCG GGCATCACAA GCAGGCCGCC GTGAAAATTG ACCGTGCATC GGCGATCGCT TCGGCGAACG GGATTGCTCC TCCCGTGCTT CGCGCCAGGC TTGACCGGAG CACGGCCGCC GTCGACACGG CGTTTGACAG GACCGCCTCG GCTGCCGCCA GGCTGGCGCA AGCCTCGTAT GATTTCGAAC ATGCCCTGCC CGGTTCGCGC CCGGTCGCCG ACACGCACCT GCTGCGCGGC GGCGTGCTCG ACCTCGCCGG TCAACCCGGA AAGGCCCTGG AGGCTTGCCG GGCCGGCATA AAGCTTCTCG CCCGCCTGCG CCTCGGCACC TCTGCGGCCC TGGTTGCTCC GTGCCTCGAC GCGGCCAATC GGGAAGCGGA GAAGAATCCT GCCCGGGCCG GGGTGCTTCG CGCCGAGATG TTCGCGATGG CGGAACTCGC GCAAGGCAGC ACGACCAGCC GTGAAATCGC CGAGTCCGCC GCGCGGCTGG CCGCGGATTC AAAGAGCCCC AAAGTCGCCG CCGCGATCCG CGCCCATCAG GATGCCGTCA TCGCCCTGTC GCGCCTCTAC CGGGAGCGTG ACGGCATCGC CCATGAACGG CAGGCAACGC CGGCCGCCAT CAACGCAATC GACATGAAGA TCGCGGCGGC AACCCGGCGG CTCGCCGAAG CGGACGAGTC CGTCGAGGCG GCGGCACCGA ATTTCGGCCA GCTGGTCCAG CAGGTCGTGC CCGCCGGCAC CGTGCTCGAC CGGCTGAGGC CTGGCGAAGC CTTTCTCGAT ATCATGCCGG CGCCGGACGG CACGTGGACG TTCCTGTTGC ACGACGGGCG GATCGCCGTC GCGCACACGA AGGTCGACGA GACGCGCATG ACCGCACTGG TCCGCAAAGT GCGCCAAAGC GTCGTGCCGA CGCAGGCGGG ATTGCCGACC TTCGCGATGA CCTCTGCCGA AGCGATCTAT CGCGCGACCC TGGCCCCCTT CGCGACCGAC CTCGCGAAGA CCACGGCGAT GGTGGTTTCA CCGGCCGGGG CGCTGCTGTC GCTGCCATTC GCGCTGCTGC CCACCGAGCC GGTCGCCCCC GGCACGCCGC TGGCCAAGGT GCCCTGGCTG ATCAGGAAGA TGACGCTGGT CTACGTTCCC GCGGCCGCGA ATTTCGTCTC GCTGCGCGCC ATCGCCGGCA CGTCGCCCGC CAAGCAGCCA TGGTTTGGCT TCGGGGACTT CAAGAACACG AGCCTGGCCC AGGCCGAGGC CACGTTCAGT GGCCCCAGTT GCGGCGACAG CGCCCGGCTC TTCGCCGGCC TGCCCCACCT GCCGTATGCG AAGCTCGAAC TTGACGCCGC CCGCGCGATC TTCAAGGCAC CCGCATCGGA CGAGCTGCTG GGGGCGGCCT TCACGGTCCC GAATGTCGAG CACGCCGACC TGAAGCAATA CCGGATCCTG CATTTCGCGA CCCACGCGCT GCTGCCCTCC GAACTGCCCT GCGCTCATGA ACCGGCGATC GTCACCTCGC CGCCGCCCGG CGCCAGGTCC GCGGCGAACT CGATGCTGAC GACCTCCGAC ATCACCAATC TCAAACTGAA CGCCGACCTC GTGATCCTGT CGGCATGCAA TACGGGCGGG GGCGACGGAA AGGCCGGCGG TGAAGCGCTT TCCGGCCTTG CCCGCGCGTT CTTCTTCGCC GGCGCCCGCG CGCTGATGGT CACGCAATGG TCGGTGAACG ACCAGGTCAG CTCCTATCTG GTCGCAACCA CGCTCACCCA TCTGGCCTCA TCGACGGGCG AGGGGGCGGC CGCGAGCCTG CGGAGCGCCC AGCTCGACCT GATCAGGGGC GCCGCGTCCG GCACCCTGCC GGCCAAGCTC GCCGATCCGT TCTTCTGGGC GCCTTTCGTG GTCATCGGCG ACGGTGGACA GGGCACGCGC AATCTGGCCA AGTAG
|
Protein sequence | MRRTVLVLAA LAGLALLSAC TRPPPGAYAH DTDSPQSASG RQLALGKNTS GETCTASRTT SGGALIYCGS WHQPSAEVHA GPEANRANLN AIVTRGNWRA GLEQRYACGN PKPTTVLGHY PGEVLLCTQR IGGWPHVALA ALIRGHVWFA DGVQPAFPAM ERAVGVISGA VPAADAGAMT TSVSDSQLAD RLAAAAFTSG DIHEYGKLMT LGNRSNQAED YPAAITAYRA ALALQQKALG PENPNTVTPM LDLALNLSDQ GDYSQADALL AQAAKLAPLA VDPTAVARLD HYRGLDQLNQ GHDRRALRLL SAANRDYASL LPRSVLRPHP AATGSDDGFG LSAGSRGGSL LAAQSTLLSP IGQQALLGVI ETLRYEGVVL DAAGHHKQAA VKIDRASAIA SANGIAPPVL RARLDRSTAA VDTAFDRTAS AAARLAQASY DFEHALPGSR PVADTHLLRG GVLDLAGQPG KALEACRAGI KLLARLRLGT SAALVAPCLD AANREAEKNP ARAGVLRAEM FAMAELAQGS TTSREIAESA ARLAADSKSP KVAAAIRAHQ DAVIALSRLY RERDGIAHER QATPAAINAI DMKIAAATRR LAEADESVEA AAPNFGQLVQ QVVPAGTVLD RLRPGEAFLD IMPAPDGTWT FLLHDGRIAV AHTKVDETRM TALVRKVRQS VVPTQAGLPT FAMTSAEAIY RATLAPFATD LAKTTAMVVS PAGALLSLPF ALLPTEPVAP GTPLAKVPWL IRKMTLVYVP AAANFVSLRA IAGTSPAKQP WFGFGDFKNT SLAQAEATFS GPSCGDSARL FAGLPHLPYA KLELDAARAI FKAPASDELL GAAFTVPNVE HADLKQYRIL HFATHALLPS ELPCAHEPAI VTSPPPGARS AANSMLTTSD ITNLKLNADL VILSACNTGG GDGKAGGEAL SGLARAFFFA GARALMVTQW SVNDQVSSYL VATTLTHLAS STGEGAAASL RSAQLDLIRG AASGTLPAKL ADPFFWAPFV VIGDGGQGTR NLAK
|
| |