Gene Mbur_1913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbur_1913 
Symbol 
ID3997705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcoides burtonii DSM 6242 
KingdomArchaea 
Replicon accessionNC_007955 
Strand
Start bp2004608 
End bp2005624 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content44% 
IMG OID637959656 
Productflap endonuclease-1 
Protein accessionYP_566545 
Protein GI91773853 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) 
TIGRFAM ID[TIGR03674] flap structure-specific endonuclease 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTACGG ATATTGGTGA TCTACTTCTT AAGGATACGA TCGAGATAGC TGGCCTTTCA 
AATAAAGTAG TGGCTATCGA TGCGTATAAT ACTCTTTATC AGTTCTTAAG TATAATCCGG
CAACGTGACG GAACCCCTTT AAAGGATTCC AGGGGTCAGA TCACTTCTCA TCTTTCGGGT
ATCCTTTATA GGCTTACCAG TCTCATCGAA GCAGGTGTCA AACCTATTTT TGTCTTTGAT
GGCAAGCCTC CTGATTTCAA ATCTGACACT CTGGCAAAAC GGCATGAGGT CCGGGAAAGT
GCAACTGCTA AATGGGAAGA TGCAAAAGCG CAGGGGCTTG AGGAAGAAGC CTACAAGTAT
GCACAGGCCT CCTCAAAAGT GACCCGTGAG ATGATCGATG ATTCTGTCAG ACTATTGGAA
TTGATGGGTA TCCCTTATGT GAAAGCACCC TCTGAGGGAG AGGCACAGGC CTCATACATG
GTGCAAAAAG GGGATGCTGA TTATATCGGT TCACAGGACT ATGATTCTTT TCTTTTCGGT
GCACCACAGG TTGTTCGAAA TCTCACTATT ACCGGTAAGC GAAAGCTTCC AAAAAAGAAC
ATCTACGTGG ATGTTAAACC CGAGGTCTTG TCCCTTGTGG ATTCCCTTGG GGAACTTGGC
ATTACAAGAC AGCAATTGAT CGATATTGCC ATGTGTGTGG GCACAGATTA TAATACCGGT
CTCGAGAACA TCGGTCCGAA AAGAGCGCTT AAACTGGTGA AGGAACACGG CGATATAAAA
GTTGTACTCA AAGAACTTGG TAAAGATATC GAAGACCTTG ATGCTAAAAG AGATTTCTTC
ATGAACCCGC CCGTAACAGA CGATTATGAA CTGAAATGGA TCAAGCCTGA TCGTGCCGGG
GTAATTGATC TTCTCTGCAA AAAACATGAT TTTTCAGAGG AGAGGGTCAA TAAAGCACTT
GACCGCCTTG AAGCTAACAT AGGCGGCAGT CAAAGCACTC TTGATCAATG GTTTTAA
 
Protein sequence
MGTDIGDLLL KDTIEIAGLS NKVVAIDAYN TLYQFLSIIR QRDGTPLKDS RGQITSHLSG 
ILYRLTSLIE AGVKPIFVFD GKPPDFKSDT LAKRHEVRES ATAKWEDAKA QGLEEEAYKY
AQASSKVTRE MIDDSVRLLE LMGIPYVKAP SEGEAQASYM VQKGDADYIG SQDYDSFLFG
APQVVRNLTI TGKRKLPKKN IYVDVKPEVL SLVDSLGELG ITRQQLIDIA MCVGTDYNTG
LENIGPKRAL KLVKEHGDIK VVLKELGKDI EDLDAKRDFF MNPPVTDDYE LKWIKPDRAG
VIDLLCKKHD FSEERVNKAL DRLEANIGGS QSTLDQWF