Gene Mlg_0843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0843 
Symbol 
ID4270780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp958467 
End bp959801 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content64% 
IMG OID638125595 
Producthypothetical protein 
Protein accessionYP_741687 
Protein GI114320004 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAGAT CAACCATCGA CGATACGTTA CCCCCTATGC GCGCCGGCGA GCGCGAAAGC 
CGCGCCGGCG GCGAGCAGTA CTGCCTGAGC CCTCTGCCCC TGCCCATCGA CAGCGAACAC
GGCGTGGACG TGGCTCACAT CGACGTGCGC CACGACGAGG TCACGATGGA CCTGCGCCAG
GGCGCCACCC CCGAGAGTCT CCTGGTTACG GGGCATATTG CCAGTGGGCT CATGACTTTT
TTGGCAGGAC TGGCCCTCGT TTTCTTGCTG AGTGCCGTGT ACAAGGGATC CCTTCCCTTC
GAACTGGTAG TAGCAGGAGT GGGGGTTACT TACGGAATTG CGCTGTTTCC CTTCTTCATC
GGCATTCTCT ATCCCGATGT CATACTGAGG CGCATCCCAC CCATTCGCCT GCATCGCCAA
CGCCGCGAGG TGGCCTTCGT GGTGGAGAGC CGGGGACAGA GCATTTGGTT ACCCGACCCA
ACTAACATGT GGTTGGCGTC CACGGCCGGC ACCATTGCCA TGCTCACCGG ATTTTTTGCA
ATATGGGATT CTGTGGAGTT TTTCCGCCCT GACGTTGAGG CGGCATTTCC TCTCATGATG
CTGCTCCTGC ATATCGCCTC CCTGGCCTTC CTGCCTCTCT ATCCCTATTT CCATGACTTC
TGCCGCAAGC TCGTCGGCCA ACAACGCCAG ACGGTGCTGG TCCCCTGGGA GGACGTGGTC
GCCCTGGCGG TCTTCAACCC AAACCTCAGC GCCGGGGCGG TCACCGGCTT CGGCTGGAAC
TTCGCTCTGA TGCCGCCCGA TCCGGAACGA CCCGGCTATA CCCTGCCCGG GGCCGGCATC
ATCGTCGGGG TCGGCGGCCT CCCGGGCGCG CTGGCCCAGT GGGAGTACAT CCGCCGCTTC
ATGGAGGAGG GGCCCGAGGC CATCACCTCT TCCGCCCGGG AGTGGGGCCT CGAGTGGTAC
GACGCCTACG TGGCCCGGGA AAAGGCCGAG TGCGAACGCA CCCACGACAA GGCCCGCTGG
CGGCGCTTTC GGCGCGAGCA GTGGTGGAAT CATGCACGCT TCGCCCACTG GTACACCGAG
TACCGCATGA AGCATGTGCT GCCCAAGGCG GTGCCCAAGG GCTGGCTGGC GGGGTGGTCC
AGGCCCCTTC CCCGGGACCA GTGGGCCCGA CCTTCGCGCG AACTCACTGA ACTCGGCGAA
CAGCTACGCG CGGCCTACCA GCGCGGCGAG AAGTTCATCA AGATGGGCAA TATCGAGAAA
CGCTTTGGGG TCGAGGTAGA GCCCTCCCCA GGTACGGCTT ATCGCACACT GCCATTTGCG
GCCAGCGCGG CCTGA
 
Protein sequence
MPRSTIDDTL PPMRAGERES RAGGEQYCLS PLPLPIDSEH GVDVAHIDVR HDEVTMDLRQ 
GATPESLLVT GHIASGLMTF LAGLALVFLL SAVYKGSLPF ELVVAGVGVT YGIALFPFFI
GILYPDVILR RIPPIRLHRQ RREVAFVVES RGQSIWLPDP TNMWLASTAG TIAMLTGFFA
IWDSVEFFRP DVEAAFPLMM LLLHIASLAF LPLYPYFHDF CRKLVGQQRQ TVLVPWEDVV
ALAVFNPNLS AGAVTGFGWN FALMPPDPER PGYTLPGAGI IVGVGGLPGA LAQWEYIRRF
MEEGPEAITS SAREWGLEWY DAYVAREKAE CERTHDKARW RRFRREQWWN HARFAHWYTE
YRMKHVLPKA VPKGWLAGWS RPLPRDQWAR PSRELTELGE QLRAAYQRGE KFIKMGNIEK
RFGVEVEPSP GTAYRTLPFA ASAA