Gene Mlg_0052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0052 
Symbol 
ID4270921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp55586 
End bp56923 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content67% 
IMG OID638124777 
Producthypothetical protein 
Protein accessionYP_740899 
Protein GI114319216 
COG category[S] Function unknown 
COG ID[COG3522] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03353] type VI secretion protein, VC_A0114 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.808473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCTGCC GTAACCGAGT CGTTTGGCGT GAAGGGGCGT TCATCAAACC GCACCACTTC 
CAGCAACAAC AGCGCAGCCT CGAGGGGCTG CTGGACCTGC GGCTGCAGGC GGTCAGCGGC
TACAGCCACG GCTTCCTGCA ACTGGAACTG AACAGCGAAT TCCTGGGCTT CGGCCGGATT
GCGCTCACCC GGGCCCGCGG CATCATGCCG GACGGCACCG CCTTCGACCT GCCGGGCGAT
GACCTGGAGC CACCACCACT GGCCGTGGAC GAGGCCGGCA TGGCCAACCA GCGGGTCTAC
CTGGGGCTGC CCCTGGCCGG TGATGGCGTG GCCGAGGTCA GCGACGAGGA CGCCATCAGG
GATGATGGCC GCTACCGCCT GCATCGCCGG GAGATCTGGG ATCTTCACAC CAGTCCTGGA
GATGTCGCTG AACTGGCGGT GGCCCGCGCC GCCCCGCGCC TGCTGTTGGA GCACGATGAC
CGCAGCGGTT ATGCCTGCCT GGCAGTGGCG CGGATACTGG AGCGGCGCCC GGATGGTTCA
CTGGTACTCG ATCCCGACTT TATCCCCACC ACCCTGACCA CCCGGGTGGC GCCCGGCCTG
CAGCGGTTCA TTGGTGAGGT GGCCGGGTTG ATGCAGGCGC GAGCGCGTCG GATCGCACAG
CGGCTGGCCG CTCCGCAGCA GGCCGGGGTG GCTGATGTCT CCGACTTCAT GCTGCTGCAA
TTGCTGAACC GCCTGCAGCC CCGGTTCCAG CACCTGCAAC AGCACCGGCG GCTGCACCCG
GAGGCCCTCT ACAGCCACAT GCTGGAGGCC TGCAGCGAAC TGGCGACCTT CACCGACGAG
TCGCGGTTGC CCCGGCGCTA TCCTCCCTAC GACCACGATG CCCCCGATAC CGCCTTCCGT
GCGCTCATGC AGGGGCTCCG TCAGGCTCTC TCCACCGTGC TGGAGGCCCG AGCGGTGGCC
ATTCCCCTGG AGGCCCGCCG TCACGGGCTC ATGCTCGCGC CGCTAAGCGA TTCGACGCTG
CTGGACGAGG CCGAGTTCGT GGTCGCCGTG CGCGCGGACA TGGCGGTGGA GACGCTGCGC
CGACAGTTCA TCCAGCAGAC GAAGATCGCC GGTATCGAAC GCATCCGCGA CCTGGTCAGT
CTGCAACTGC CCGGCATTCC GCTCGTTCCG CTGCCGGTCG CGCCGCGCCA GCTCCCCTAT
CACGCGAGTC ATATCTACTT CCAGCTCGAC CGTCGCAGCG AGGCCTGGGG CCTGCTGACC
GGTGCGAGCG GTTTCGCCTT CCACCTGGGT GGTGACTTCC CAGGGCTGGA TCTCCAGTTC
TGGGCAATAA GGAGTTGA
 
Protein sequence
MVCRNRVVWR EGAFIKPHHF QQQQRSLEGL LDLRLQAVSG YSHGFLQLEL NSEFLGFGRI 
ALTRARGIMP DGTAFDLPGD DLEPPPLAVD EAGMANQRVY LGLPLAGDGV AEVSDEDAIR
DDGRYRLHRR EIWDLHTSPG DVAELAVARA APRLLLEHDD RSGYACLAVA RILERRPDGS
LVLDPDFIPT TLTTRVAPGL QRFIGEVAGL MQARARRIAQ RLAAPQQAGV ADVSDFMLLQ
LLNRLQPRFQ HLQQHRRLHP EALYSHMLEA CSELATFTDE SRLPRRYPPY DHDAPDTAFR
ALMQGLRQAL STVLEARAVA IPLEARRHGL MLAPLSDSTL LDEAEFVVAV RADMAVETLR
RQFIQQTKIA GIERIRDLVS LQLPGIPLVP LPVAPRQLPY HASHIYFQLD RRSEAWGLLT
GASGFAFHLG GDFPGLDLQF WAIRS