Gene Mlg_2501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2501 
Symbol 
ID4270820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2841443 
End bp2842669 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content65% 
IMG OID638127259 
Producthypothetical protein 
Protein accessionYP_743331 
Protein GI114321648 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGCA CATGCCATAC CAAAAAACCG GTTTGCCTGG GGGTGGCCAC CGCCGCACTG 
CTGGGGCTGA TCATCTCACC GGCGTGGGCG GAGACCACCG TCGATTTCGG CGGCTACGTG
AAGATGGATG CCCACTTCAG TGATGTGACC AACCGTGCCG CCAACGACCG CAGCGAGGCC
TTCATGATCC CCGCCCTGAT CCCGCTGGAC GGGCAGGAGG ACTCACGGAA TGTCACCCGC
TACAGCGTGC GTGAGAGTCG GGTCAACCTG CGCACGCAGA CGCCCACCGG CCTGGGGGAT
CTGACCACGT TTCTCGAAGT GGACTTTCTC GAGGATGCCG CCATCGACCG CAACCGCCTG
GTGGGCAACC AGCCCCCGCG GCTACGGCAC GCCTTCGGTC AGCTGGGCAA CTGGCTGGCC
GGGCAGACCT GGGGCACTTT CTACAACGTT ACCACCAAAC CGGAGACCCT GGATTTCGTC
GGGCCCGCCG GCACGGTGTT CAATCGCAAT ATCCAGGTGC GCTACACCCT GCCGTTGGAG
CAGGGCAACA GCCTGATGCT GGCGGTGGAA CAGCCCTTCA CCACCCTGGC CTCCGAGGCG
ACGTTGGGTG AGGCCGATCC GGGCGATGCC ATCCGCAACG CCCGGGATGA CCGCTGGCCG
GAGTTCGTCG CCCGCTACAA CGTCTCGGGG GATTGGGGCC ACGGCTCCCT GGCTGGCGTC
GCCCGGAATC TCCGGGTCGA TCGCAGCACC TCGCGGGAAC TCGGCGCCGA CGTGGACGAC
GACGAGTGGG TGGGCGCACT CAGCCTGACC GGCGTGGTGA AGGCCGGAGG GCGCAATGAT
GTCCGCTTTC AGCTCAACTA CGGCGACGGA CTCGGCCGTT ACCTCGGCCT GAACGCCTTT
CCCGATGCCT TTATCGACGA CCAGGGCAAT CTGGACTCCT TGAGCATCTG GGGTGGGTAT
GTGTCCTATC GGCACTGGTG GAATCAGACC CTGCGCAGCA GCCTGGTCTA CAGCCTGGCC
AAGGCCGACA ACCCCAGCAG CGCCCCGGAG ACGGCCAACG AGCAGATCCA GTCGGTGCAC
CTCAACCTGA TCTATACACC GGTGCAGAAC GTGGACGTGG GCGTGGAGTA CATCTGGGCC
GAGCGGGAGA TCGAGGGCGA GGACGCCTAT GGTGAGGACA GCGGCGAGCT GAACCGCGTG
CAGGTCTCGG CGAAGTACAG CTTCTGA
 
Protein sequence
MRSTCHTKKP VCLGVATAAL LGLIISPAWA ETTVDFGGYV KMDAHFSDVT NRAANDRSEA 
FMIPALIPLD GQEDSRNVTR YSVRESRVNL RTQTPTGLGD LTTFLEVDFL EDAAIDRNRL
VGNQPPRLRH AFGQLGNWLA GQTWGTFYNV TTKPETLDFV GPAGTVFNRN IQVRYTLPLE
QGNSLMLAVE QPFTTLASEA TLGEADPGDA IRNARDDRWP EFVARYNVSG DWGHGSLAGV
ARNLRVDRST SRELGADVDD DEWVGALSLT GVVKAGGRND VRFQLNYGDG LGRYLGLNAF
PDAFIDDQGN LDSLSIWGGY VSYRHWWNQT LRSSLVYSLA KADNPSSAPE TANEQIQSVH
LNLIYTPVQN VDVGVEYIWA EREIEGEDAY GEDSGELNRV QVSAKYSF