Gene Mlg_0048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0048 
Symbol 
ID4270917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp51556 
End bp53319 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content66% 
IMG OID638124773 
Producthypothetical protein 
Protein accessionYP_740895 
Protein GI114319212 
COG category[S] Function unknown 
COG ID[COG3519] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03359] type VI secretion protein, VC_A0110 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAACC GCTACTACCG GGACGAACTC AACTTCCTTC GCCAGGAGGG CAGGGCGTTC 
GCTCAGGCGT ATCCGCACCT CAGCCGTTTC CTCTCGGAGC CGGGGGACGA CCCGGACGTG
GAGCGGTTGC TGGAGGGCTT CGCCTTCCTC ACCGGACGCA TGCGCGAGAA GGTGGAGGAT
GAGTTCCCGG AGCTCACCCA CTCGCTCATT AGCATGCTCT GGCCCAACTA TCTGCGTCCG
GTCCCGAGCA TGACCATTGT CCGTTTCGAT CCCCGGTGGC ATGCGTTGCG TGCCGGGCAC
CGGCTGCCGC GCGGCACCGC GCTGCGGAGC CAGCCGGTAC AGGGTACCCC CTGCCGATTC
CGGACCAGTC ACGACGTCAC CCTGTATCCG TTGGAGGTGG CCGGTGTTGA CACCGCCCGT
TCACGCAGCC GGTCCCAGGT GACGTTACGG CTTGCCGTGC ACAGTGATCA GCCGCTCGCG
GACCTGCCGG CCGATCCGCT GCGTTTCTAT CTTGGCGGTG ACGGCTATAC GGCGCGCACG
CTCTATCTCT GGCTGCAGCA TTATCTGGAG GGCGTGGATC TGGAGGTGGC CGGCGAGCGT
CGCAGCCTGC CGGCGGACGC TATCAGTCCG GTCGGCTTCG AGCGTGATCA GTCCCTGCTG
CCCTACCCGC GCAATAGTTT TCAAGGGTAT CGCATCCTCC AGGAATACCT CTGTGTCCCG
GACGCATTCC GTTTCCTTGA CTTGCAGCGC TTGTCTGCCG CCCTGCCCCA CGAGGCGGCC
GACGAGATCC GGCTGGTGTT CCGTTTCTCA CGCACCCTGC CGAGGGATGC CCGCCTGTCG
GTGGACCACT TCCAGCTCCA TTGCACCCCG GCGGTCAACC TCTTCGAGCA GGACGCCGAC
CCGATTGACC TGACCGGCGA GCGTGCCGAG TACCCGATCC TGCCCAGTAG CCGTAACCCC
GCCCACTACG AGGTCTATAG TGTCGATGCG GTGGAAGGGT GGCTCACCAC GGGCAGCGGC
CGGTTCCGCG GCGAGCCCCG CCGTTATGTG CCCTTCGAGA GCTTCCAGCA CCAGCTCGAG
CGCGACCGTG GAGGGGATGC GCGCTACTAC CGGCTCCGCG TACGGGAGAG CGTGCGCGAC
GACGGCTTCG CCCACGATAT CGCCTTCGTA CGGGAGGACG AGGTCTACCG GTTGGCGCAC
CATGAGACCG TCTCCTTGCG CCTGACCTGC ACCAACCGGC GGTTGCCCGA ATCCCTGGGC
GTGGGTGACA TCACCGACTT TGCCGACGAC AGCCCCGCCC TGGTCACGGC CCGCAACATC
ACCCGGCCCA CCCCTGCCCT CAGGCCGCAA CTGGACGGCG GCCTGCTGTG GACGCTGATC
TCCAATCTCG CCCTGAACTA TCTCTCGCTG TTGCATACCG ACGCGCTCCG TTCGGTCCTG
CGGGCCTACG ATTTTCGCGC GTTGGTGGAC CGTCAGGCCG AGCGCGCCTC GCAGCAACGC
CTGGCGGGCA TACGGGCCAT CGACACCGTG CCGGTGGATC GTTTGCACCA CGGCCTGCCG
GTGCGCGGGA TGCGCTCGGT GGTAACGTTG GACGAAGCCG CCTTCGGTGA CGAGGGCGGG
CTCTACCAGT TCGGCTGCGT GCTGGCGCGC TTTCTGGCGC TGTACGCCAG CATCAACGCC
TTTCACGAGC TGCAGGTCGT CAATCTCAGA AACCAGGAGC GCTACACATG GAAGTGGCAG
CCCGGTCAGC AACCGCTGAT GTGA
 
Protein sequence
MLNRYYRDEL NFLRQEGRAF AQAYPHLSRF LSEPGDDPDV ERLLEGFAFL TGRMREKVED 
EFPELTHSLI SMLWPNYLRP VPSMTIVRFD PRWHALRAGH RLPRGTALRS QPVQGTPCRF
RTSHDVTLYP LEVAGVDTAR SRSRSQVTLR LAVHSDQPLA DLPADPLRFY LGGDGYTART
LYLWLQHYLE GVDLEVAGER RSLPADAISP VGFERDQSLL PYPRNSFQGY RILQEYLCVP
DAFRFLDLQR LSAALPHEAA DEIRLVFRFS RTLPRDARLS VDHFQLHCTP AVNLFEQDAD
PIDLTGERAE YPILPSSRNP AHYEVYSVDA VEGWLTTGSG RFRGEPRRYV PFESFQHQLE
RDRGGDARYY RLRVRESVRD DGFAHDIAFV REDEVYRLAH HETVSLRLTC TNRRLPESLG
VGDITDFADD SPALVTARNI TRPTPALRPQ LDGGLLWTLI SNLALNYLSL LHTDALRSVL
RAYDFRALVD RQAERASQQR LAGIRAIDTV PVDRLHHGLP VRGMRSVVTL DEAAFGDEGG
LYQFGCVLAR FLALYASINA FHELQVVNLR NQERYTWKWQ PGQQPLM