Gene Mlg_0567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0567 
Symbol 
ID4270897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp614779 
End bp615834 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content71% 
IMG OID638125309 
Productputative iron-sulfur cluster binding protein 
Protein accessionYP_741411 
Protein GI114319728 
COG category[C] Energy production and conversion 
COG ID[COG1600] Uncharacterized Fe-S protein 
TIGRFAM ID[TIGR00276] iron-sulfur cluster binding protein, putative 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.000000095746 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGGCGC TGGCCGAACG CATCCGCGTC TGGGCGGGCG AGCTGGGGTT CACCGGCGTG 
GGTATCGCCG ACCCGGACCT GCGCCAGGAT GAGTACTGGT TGCTGCGCTG GCTGCGCCGG
GGCTGGCAGG GGACCATGGG CTGGATGGGG CGCCACGGGG TGAAGCGCAG CCGACCGCAG
CGGCTGCTGC CGGGCACGGG GCGGATCATC TCGGTGCGGC TGGACTACCA GCCGGCGGGG
GCGGAGCCCT GGTCGGTGCT GGCGGACGGG CGCAAGGCCT ATGTCGCCCG CTACGCCCTG
GGGCGGGACT ATCACAAGCT GATGCGCCAG CGGCTGCAGA AACTCGCCCG GCGGATCGAG
ACCGAGGTGG GGCCGTACGG TTACCGCGCC TTTGTGGACA GCGCCCCGGT GCTGGAGAAG
GCGGTGGGCC GGGAGGCGGA CCTGGGCTGG ATCGGCAAGC ACACGCTGTT GATGGACCGG
GACGCCAGCT CGTGGTTCTT CCTGGGGGAG CTGTTCACGG ACCTGCCCCT GCCCGCCGAC
CCGCCACGGC GCCGCGGCCA CTGCGGCCGG TGCCGCGCCT GTATCGATGT CTGCCCGACG
GGGGCCATCG TCGGCCCCTA CCAACTCGAT GCCCGCCTCT GCATCAGTTA CCTCACCATC
GAACACGACG GCCCGATCCC GGAGCCGCTG CGGCCGCTGA TGGGCAACCG GGTGTTCGGC
TGCGACGACT GCCAGCTCAT CTGCCCGTGG AACAAGTTCG CCCGGCCGAC GGCGGAGGGG
GACTTCCAAC CGCGGCACAA CCTGGACCAC GCGGACCTGG TGGAACTGTT CGGCTGGACC
GAATCGCAGT TTCTCGACCG GATGGCGGGC TCGGCGATCC GTCGCCTGGG GCACGAGCGG
TGGCTGCGCA ACCTCGCCGT AGCCTTGGGC AACGGGCCGG CCAGCGCCGA GGCCGTGGCG
GCGCTGGAGG CGCGACAGGA GCACCCGTCA GCCCTGGTGC GCGAGCATGT GGCGTGGGCC
TTGAGACGGT TGACGGAACC CGGAAACGCG GAATAA
 
Protein sequence
MQALAERIRV WAGELGFTGV GIADPDLRQD EYWLLRWLRR GWQGTMGWMG RHGVKRSRPQ 
RLLPGTGRII SVRLDYQPAG AEPWSVLADG RKAYVARYAL GRDYHKLMRQ RLQKLARRIE
TEVGPYGYRA FVDSAPVLEK AVGREADLGW IGKHTLLMDR DASSWFFLGE LFTDLPLPAD
PPRRRGHCGR CRACIDVCPT GAIVGPYQLD ARLCISYLTI EHDGPIPEPL RPLMGNRVFG
CDDCQLICPW NKFARPTAEG DFQPRHNLDH ADLVELFGWT ESQFLDRMAG SAIRRLGHER
WLRNLAVALG NGPASAEAVA ALEARQEHPS ALVREHVAWA LRRLTEPGNA E