Gene Ava_0856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0856 
SymbolmutL 
ID3681757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1046067 
End bp1047758 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content47% 
IMG OID637716190 
ProductDNA mismatch repair protein 
Protein accessionYP_321375 
Protein GI75907079 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.127001 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCTA CTATTCAAGC TTTACCTCAA GAAGTCGTGT ACCTCATTAC AGCAGGGGAG 
GTCATCGACT CATTTGCTGC TGTAGTGCGG GAATTGGTAG AAAATTCCTT GGATGCAGGT
GCAAACAGAA TTGTAGTTTA TTTATGGCCG CAGCAATGGC GTGTTCGTGT GGCAGATAAT
GGTTGCGGGA TGAACTTAGA TGATTTGCAA CAAGCCGCTT CAGCCCACAG CACCAGTAAA
ATTCGTTCTA GTGCAGATTT GTGGAAAATT CATAGTTTGG GATTTCGGGG TGAAGCACTA
CATAGTTTAA CGACACTAGC AGATGTAGAA ATTATCAGTC GCGCCCCCGA AAATGTGGGC
TGGCGAGTGG TTTATGGTGA TAATGGGGAA GCAACTCAAG TTGAAGCGAC AGCGATCGCT
CCTGGTACAG TAGTCACAGT CTCGAATTTA TTCGCCAGTT GTGCGGCTCG TCGTCAAGGC
TTACCCACAA CAGCACAGCA AATGAAAGCT GTGCAAGCCA CAATTCAACA AATCGCCCTT
TGTCATCCCC AAACCACCTG GCAAGTTTGG CAAAATGACC GGATTTGGTT CACCATCTCC
CCCGCCGCCA CAGCCGGACA ACTCATACCC CAATTCCTCC CCCAACTACG CCCCGGTGAT
TTACAAGAAA TTAAGCTAGA GATACCCAAC CCAGAAAACC CACAACTCAG CACCAACAAC
AAGGCAAACG CCACAACACT TTCCCTAGTT GTAGGATTGC CAGACCGTTG TCACCGCCAT
CGTCCAGATT GGGTGCGGGT AGCAATTAAC GGACGGATGA TTAAGTCGCC AGAATTAGAA
CAAACAATTT TGGCAGCATT CCACAGAACA TTACCACGCG ATCGCTACCC CCTTTGTTTT
CTCCATCTTC TGATTTCCCC CGACCAAATC AACTGGAACC GCAACCCAGC CAAAACAGAA
ATTTACCTCC ACGATTTGAG TTATTGGCAA GAACAAGTAA CTCAAGCCAT TAACCAAACA
CTACGGATTA GTGCTGCCAA TATAAAAGAA TCTGTCCAGA CAACGCGAGT GAGTCAACTA
CTCAAAGCCG CCGAGGAAAA AGGGAACTAT AACTTTAATC CTCAAAACGC CAATCCAGCA
GACAACACTC AGCACTACTT AAAAGCAGTA GCCCAAGTCA GTAACACTTA TATAGTCGCA
GAACATTCTG GGGGAATGTG GCTAGTAGAA CAGCACATTG CCCATGAGCG AGTTTTATAC
GAACAATTGT GTGATAACTG GCGACTTATT CCCGTTGAAC CACCAATTAT TCTTTATCAG
CTATCCCCAG CCCAAGTTGC TCAACTGCAA CGCATCGGTT TAGATATTGA CATCTTTGGC
GAACAACTTT GGGCAGTGCG TAACCTGCCA GCAATGTTAC AACAACGTGA AGACTGTGCC
GAAGCAATTT TAGAACTCAG TTGGGGAGGT GACTTACAAA CAGCCCAAGT AGCCGTCGCC
TGTCGCAGTG CTATTCGCAA CGGTACTCCC ATGAGTCTAC CAGAAATGCA GAAGTTACTA
GACGATTGGC AACGTACTCG CAACCCCCGC ACCTGTCCCC ACGGTCGGCC GATTTATTTG
TCCTTAGATG AGTCTTCCTT ATCTCGGTTT TTCCGTCGTC ATTGGGTGAT TGGCAAAAGT
CACGGTATAT AG
 
Protein sequence
MASTIQALPQ EVVYLITAGE VIDSFAAVVR ELVENSLDAG ANRIVVYLWP QQWRVRVADN 
GCGMNLDDLQ QAASAHSTSK IRSSADLWKI HSLGFRGEAL HSLTTLADVE IISRAPENVG
WRVVYGDNGE ATQVEATAIA PGTVVTVSNL FASCAARRQG LPTTAQQMKA VQATIQQIAL
CHPQTTWQVW QNDRIWFTIS PAATAGQLIP QFLPQLRPGD LQEIKLEIPN PENPQLSTNN
KANATTLSLV VGLPDRCHRH RPDWVRVAIN GRMIKSPELE QTILAAFHRT LPRDRYPLCF
LHLLISPDQI NWNRNPAKTE IYLHDLSYWQ EQVTQAINQT LRISAANIKE SVQTTRVSQL
LKAAEEKGNY NFNPQNANPA DNTQHYLKAV AQVSNTYIVA EHSGGMWLVE QHIAHERVLY
EQLCDNWRLI PVEPPIILYQ LSPAQVAQLQ RIGLDIDIFG EQLWAVRNLP AMLQQREDCA
EAILELSWGG DLQTAQVAVA CRSAIRNGTP MSLPEMQKLL DDWQRTRNPR TCPHGRPIYL
SLDESSLSRF FRRHWVIGKS HGI