Gene Mlg_0693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0693 
Symbol 
ID4268852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp770997 
End bp772373 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content72% 
IMG OID638125442 
Productexodeoxyribonuclease VII large subunit 
Protein accessionYP_741537 
Protein GI114319854 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.183752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0687889 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTACCA CCGCATCCCA GGCCCCGCGC ACGGTCTACA CCGTCAGCCA GCTCAACCAG 
GAGGTCCGCA GCCTGCTGGA GACGACCCTG CCGCCGCTCT GGGTGGAGGG CGAGATCAGC
AACCTGGCGC GGCCGCGCTC GGGCCATCTC TATTTCACCC TCAAAGACAG CGCCGCCCAG
GTGCGCTGCG CCATGTTCCG CAACCGCAAC CTGCTGCTGC GCTTCCAGCC CGGCGACGGT
CAGCGGGTGC TGGTGAGGGC CCGCGCCGGG CTCTACCCCG CCCGCGGCGA GTTCCAGCTG
GTGGTGGACC ACATGGAGGA GGCCGGCGAG GGGGCCCTGC GGCGGGCCTT CGAGGCCCTG
AAGGCCCGGC TGGAGCAGGA GGGGCTGTTC GATCCGGCCC ACAAGCGGCC GCTTCCCGCC
TTCCCCCGCC GGTTGGGGGT AATCACCTCG CCCACCGGGG CCGCCATCCG CGACGTGCTG
ACGGTGCTCC GGCGCCGCTT TCCGGCCCTG CCGGTGCTGA TCTACCCGGT GCCGGTGCAG
GGAGAGGGGG CCGGCGGACA GATCGCCGCC GCCATTGAAG AGGCCGACCG CCGTCGCGAC
GTGGACGTAG TGCTGGTCAC CCGGGGCGGT GGCTCGCTGG AGGATCTATG GGCCTTCAAC
GAGGAGGTGG TGGCGCGGGC CATCCACGCC TGCGGCCTGC CGGTGGTCAG TGCCGTGGGG
CACGAGGTGG ATGTCACCAT CTCCGACCTG GTGGCCGACC AGCGCGCACC CACGCCGTCG
GCCGCCGCCG AACTCATCTC GCCGGACGGC CCGGCGCTGC TGCACCAAGT CCGGGGTCTG
CGGGACCGAC TGCTGCAGCT GACCACCCAG CACCACCGGC GGGCCAGTGA TCGGCTGAAC
GGCCTCGCCC GCCGGCTGCA GGCCCGGCAC CCGGGCCAGC TTCTGCGCGA CCGCAGTCAG
CGCCTGGACG AGCTGGACCA ACGCCTGCGC CATGCTATGG CGCAGCGCCT TGCCCGGCAC
ACGCAGGCCC TGGACCATTT GCGCGCCCGG TTACGCCAGG GTGATCCGCG GCTCACCATC
CGCCGGCGCG AGGAACAGCG CCTGGCGTTG GAGCGACGGC TGCATGCGGC CGTACGCCAG
CAGCTACAGG CCCGGGAGCA GCGGCTGTCC GGGCTCGGTC GTGCGTTGCA TGCGGTCAGT
CCGCTGGCCA CCCTCTCCCG CGGCTACGCC ATCGCCCGCC AGGGCGCTGA CGGGCCGGTG
CTGCGCGACA GCACCCAAGT GGCACCCGGT GATGCCGTCC GAGTCCGGTT ACACCGGGGG
CAGCTGGACT GCCGGGTGGA GCGGGTCCAC GGCGAACCGG AGGGTGGCAA ACAGTGA
 
Protein sequence
MVTTASQAPR TVYTVSQLNQ EVRSLLETTL PPLWVEGEIS NLARPRSGHL YFTLKDSAAQ 
VRCAMFRNRN LLLRFQPGDG QRVLVRARAG LYPARGEFQL VVDHMEEAGE GALRRAFEAL
KARLEQEGLF DPAHKRPLPA FPRRLGVITS PTGAAIRDVL TVLRRRFPAL PVLIYPVPVQ
GEGAGGQIAA AIEEADRRRD VDVVLVTRGG GSLEDLWAFN EEVVARAIHA CGLPVVSAVG
HEVDVTISDL VADQRAPTPS AAAELISPDG PALLHQVRGL RDRLLQLTTQ HHRRASDRLN
GLARRLQARH PGQLLRDRSQ RLDELDQRLR HAMAQRLARH TQALDHLRAR LRQGDPRLTI
RRREEQRLAL ERRLHAAVRQ QLQAREQRLS GLGRALHAVS PLATLSRGYA IARQGADGPV
LRDSTQVAPG DAVRVRLHRG QLDCRVERVH GEPEGGKQ