Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0567 |
Symbol | |
ID | 4270897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 614779 |
End bp | 615834 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638125309 |
Product | putative iron-sulfur cluster binding protein |
Protein accession | YP_741411 |
Protein GI | 114319728 |
COG category | [C] Energy production and conversion |
COG ID | [COG1600] Uncharacterized Fe-S protein |
TIGRFAM ID | [TIGR00276] iron-sulfur cluster binding protein, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.000000095746 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGGCGC TGGCCGAACG CATCCGCGTC TGGGCGGGCG AGCTGGGGTT CACCGGCGTG GGTATCGCCG ACCCGGACCT GCGCCAGGAT GAGTACTGGT TGCTGCGCTG GCTGCGCCGG GGCTGGCAGG GGACCATGGG CTGGATGGGG CGCCACGGGG TGAAGCGCAG CCGACCGCAG CGGCTGCTGC CGGGCACGGG GCGGATCATC TCGGTGCGGC TGGACTACCA GCCGGCGGGG GCGGAGCCCT GGTCGGTGCT GGCGGACGGG CGCAAGGCCT ATGTCGCCCG CTACGCCCTG GGGCGGGACT ATCACAAGCT GATGCGCCAG CGGCTGCAGA AACTCGCCCG GCGGATCGAG ACCGAGGTGG GGCCGTACGG TTACCGCGCC TTTGTGGACA GCGCCCCGGT GCTGGAGAAG GCGGTGGGCC GGGAGGCGGA CCTGGGCTGG ATCGGCAAGC ACACGCTGTT GATGGACCGG GACGCCAGCT CGTGGTTCTT CCTGGGGGAG CTGTTCACGG ACCTGCCCCT GCCCGCCGAC CCGCCACGGC GCCGCGGCCA CTGCGGCCGG TGCCGCGCCT GTATCGATGT CTGCCCGACG GGGGCCATCG TCGGCCCCTA CCAACTCGAT GCCCGCCTCT GCATCAGTTA CCTCACCATC GAACACGACG GCCCGATCCC GGAGCCGCTG CGGCCGCTGA TGGGCAACCG GGTGTTCGGC TGCGACGACT GCCAGCTCAT CTGCCCGTGG AACAAGTTCG CCCGGCCGAC GGCGGAGGGG GACTTCCAAC CGCGGCACAA CCTGGACCAC GCGGACCTGG TGGAACTGTT CGGCTGGACC GAATCGCAGT TTCTCGACCG GATGGCGGGC TCGGCGATCC GTCGCCTGGG GCACGAGCGG TGGCTGCGCA ACCTCGCCGT AGCCTTGGGC AACGGGCCGG CCAGCGCCGA GGCCGTGGCG GCGCTGGAGG CGCGACAGGA GCACCCGTCA GCCCTGGTGC GCGAGCATGT GGCGTGGGCC TTGAGACGGT TGACGGAACC CGGAAACGCG GAATAA
|
Protein sequence | MQALAERIRV WAGELGFTGV GIADPDLRQD EYWLLRWLRR GWQGTMGWMG RHGVKRSRPQ RLLPGTGRII SVRLDYQPAG AEPWSVLADG RKAYVARYAL GRDYHKLMRQ RLQKLARRIE TEVGPYGYRA FVDSAPVLEK AVGREADLGW IGKHTLLMDR DASSWFFLGE LFTDLPLPAD PPRRRGHCGR CRACIDVCPT GAIVGPYQLD ARLCISYLTI EHDGPIPEPL RPLMGNRVFG CDDCQLICPW NKFARPTAEG DFQPRHNLDH ADLVELFGWT ESQFLDRMAG SAIRRLGHER WLRNLAVALG NGPASAEAVA ALEARQEHPS ALVREHVAWA LRRLTEPGNA E
|
| |