Gene Mlg_1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1066 
Symbol 
ID4268987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1245037 
End bp1246428 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content65% 
IMG OID638125817 
Producthypothetical protein 
Protein accessionYP_741908 
Protein GI114320225 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTCA CCATCCGCGA CGTGATGACC GATCCCGCCC TGTTCGGTGG CCAGTTCGGT 
GGCGACACCT GGGCCGCCTG GCGTGCGCTC CTGAGCGGCT TTTATGGCCT CCCGCTGGAC
GATGCCGAGG CACAGCACTG GCACGCGCTC ACAGACCGCG AGAGCGCCCC GCAGAGCGCA
CATGACGAGT TGTGGCTAGT GGTAGGCCGC CGCGGTGGCA AGTCCAATGC AGCGGCCTTG
CTGGCGGTCT ATGAGGCGTG TTTCAAAGAC CACCGCGATG CCCTGGCACC CGGTGAGGTT
GCCACCACCC GCGTCATGGC TGCCGACCGT GCGCAGGCCC GCAGCGTGTT CCGGTATATC
TCCGGTCTGA TGCACGCGAA TCCGATGCTG GAACGGCTGA TCGTGCGCGA GGATCGGGAA
TCCATAGAGC TGTCCAACCG GGCTGTTATC GAGGTGGGCA CGGCCTCATT CCGCACGACA
CGCGGCTACA CGTTCGCGGC GGTGATTGCC GACGAGGTGG CGTTCTGGCG CTCCGATGAC
AGCGCGAACC CTGACAGCGA GATCATTGCC GCCGTGCGTC CCGGTCTGGC CACGCTGAAC
GGCAAGCTGA TCGCGCTTTC CAGCCCATAC GCCCGACGCG GTGAGCTATG GGAAAACTAC
CGCCGACACT ACGGCAAGGC ATCGCCCATC CTGGTGGCGC AGGCTCCCAG CCGCACCATG
AATCCCTCAT TGCCTGAGCG CGTGGTCACG GAGGCAATGG AGCGTGACCC GGCCAGTGCG
GCGGCGGAGT ACCTGGCGGA GTTCAGGACG GACGTGGAGA CCTTCCTGCA ACGCGAAGTA
GTAGAGGCCG CCACGCGGCC CACCCCGCTG GAGTTGCCCT ACAACAAGCG CGTTACCTAT
ACCGCCTTTG TTGATCCGGC AGGTGGTGGC GCGGATGAGT TCACCGCCGC CATCGGCCAC
CGGGAAGGGG AGCGCGTGGT CGTGGACGTG CTACGCGCCC GCAAGGGTAC GCCTGCCGAG
ATCGTTGCCG AATACGCCGA CCTGCTGAAG TCCTACCGGA TCACCCGCGC TATCTCGGAT
CGTTATGCAG GCTCATGGCC TGCCGACGAG TTCAGCCGCC ACGGGATCAC CGTAGAGCAG
GCCGCTAAAC CGAAGTCAGA CCTTTATCGG GACATGCTCG CCAGCATGAA CAGCGCCCGC
GTGGAGCTTC CGCCCGATGA TCGGCTGATG ACCCAGCTAA TCAGCCTGGA GCGCCGCACA
GCACGCGGTG GCCGGGACAG TATCGACCAC GCCCCCGGTG GTCACGATGA CAGAGCAAAC
GCCGTTGCCG GTCTGGTGGC GGCCAACTCA CGCGCCCCAG GCGAACGGAT GCGGGCGCTT
TGCACTTGGT AG
 
Protein sequence
MSVTIRDVMT DPALFGGQFG GDTWAAWRAL LSGFYGLPLD DAEAQHWHAL TDRESAPQSA 
HDELWLVVGR RGGKSNAAAL LAVYEACFKD HRDALAPGEV ATTRVMAADR AQARSVFRYI
SGLMHANPML ERLIVREDRE SIELSNRAVI EVGTASFRTT RGYTFAAVIA DEVAFWRSDD
SANPDSEIIA AVRPGLATLN GKLIALSSPY ARRGELWENY RRHYGKASPI LVAQAPSRTM
NPSLPERVVT EAMERDPASA AAEYLAEFRT DVETFLQREV VEAATRPTPL ELPYNKRVTY
TAFVDPAGGG ADEFTAAIGH REGERVVVDV LRARKGTPAE IVAEYADLLK SYRITRAISD
RYAGSWPADE FSRHGITVEQ AAKPKSDLYR DMLASMNSAR VELPPDDRLM TQLISLERRT
ARGGRDSIDH APGGHDDRAN AVAGLVAANS RAPGERMRAL CTW