Gene M446_1478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1478 
Symbol 
ID6130288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1632910 
End bp1634499 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content61% 
IMG OID641641748 
Productintegrase family protein 
Protein accessionYP_001768417 
Protein GI170739762 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTTC AGATGGCCCG CCCAATGAGG CGTTCAGGCT CCTCCTTCCA CCAGCTGGTT 
CAGCGCATCC CGGCGGACGT AGCCGCGAAG GTGCGCGGTA TGAGGCTCTC CATCCCGATA
GGGGAGGGGG AGGCGCACCT CGTCATCTCG GACAAGGCCA CGGACGTTCG CACGTCACTG
CGCACCCGTG ACCCGGCGGT TGCGAAGGCG CGGCAGGCTG TCGCTGTGGC GTACTTGGAG
AAGGTCTGGC GTTCGGTCCG TGAAGGGCCA CAGAGACTCA CACAGAGGCA AACGGTGGCT
CTGGCTGGTG AGGCTTACAC CGCATTGAAA ACGGCTCTGG AGGACGATCC TGGGGCACCT
GAGCGGTGGC TTCGTGTTGA GGTCGATAAT CTACGGGCAG AGTCCGGTCA GTATGGCCGC
TCGGCGCTGA TGATTGGCTC TGCCGAGGAG AAGCAGGCGA GAAGCATTGA GGAGCGAGTT
GGTGGCTTCT CAGACATGGT TCTCGCTAAG CATGCGCTGA ACATCGACGC AGACAGCAGA
GCGAGACTCA ATCAAGAGCT CCTAGCGGCG CTTCGTCAGG CTGGGCTTGT GTTGATGAGA
CATGCCCAAG GCGACTACAG GCCAGACCCT GACGTGAGTC GCTTTCCTGA TTTCCAGACA
TCATCAAGTA AGGCCACGGC CGTCAGCCTT CTAGACCTAT TTGATGGGTG GGCGAAGGAG
CGGAAACCTT CTCAGAGCAC GGTCGATCAG TGGCGCAAGC ACTGTGAGGC ATTCCTAGCC
TTCATCGGCA AGGACGATGC TGGGCGAGTA ACGAAGGCTG ACGTCGTGGC CTGGAAGGAT
ACGCTCGTGG CAGCGGGAGG CGCACCGAAG ACGATCAACG ACAGTAAGCT CGCAGCCCTG
CGTGTTGCCT TCACCTGGGG CGTTGAGAAC GTTCGTGTCA GGTCCAACCC TGCGACAGGC
GTGGCTGTGC GTCAGAAGAT GCAGGCCGGC GAGCAGATGC TTGGCTTCGA TGATGGCGAG
GCTGCTGCGA TCCTACAGGC CGCCGCGAAG GAGACGCGGC CCTACATTCG TTGGCTTCCG
CTCCTTTGTG CGGCTTCAGG GGCTCGCGTG GGGGAGATGG CACAGCTTCG TGCAGAGGAC
GTGATAGTTC AGGATGGCAT TCCTGCCCTA TGCATCACGG CGGAGGCTGG GTCGCTGAAG
AACCTGAACT CGGAACGCGT CATCCCCCTC CACCCTGGCG TGATCGATGC TGGGTTCTTG
GAGTTCGTGA AGGGCAAGAA GGGGCCTCTG TTCTACAACC CAGTGAGGCG CAGGACAGAC
GCGAAGAAGC CTTCACACAA GATTGTCGCC AAGAACGTAG CAACGTGGGT GCAGGGGCTT
GGCTTGCAGG TTGGGCGGCA GCACCGCAAA GACCCGTCAC ACGCCTGGAG GCACCGCTTC
AAGACACTCG CGCGGGCTGC CAGGATTGAG GACAGTGTGG CCGATGCCAT TGTTGGACAC
GCACCCGGGA GCGTAGCCAA GGCTTACGGC ACGGTGACGC TGGCGACCAT GCACGAAGCT
GTGTCCCGCA TCCCTATTCC GAAGTGTTAG
 
Protein sequence
MPLQMARPMR RSGSSFHQLV QRIPADVAAK VRGMRLSIPI GEGEAHLVIS DKATDVRTSL 
RTRDPAVAKA RQAVAVAYLE KVWRSVREGP QRLTQRQTVA LAGEAYTALK TALEDDPGAP
ERWLRVEVDN LRAESGQYGR SALMIGSAEE KQARSIEERV GGFSDMVLAK HALNIDADSR
ARLNQELLAA LRQAGLVLMR HAQGDYRPDP DVSRFPDFQT SSSKATAVSL LDLFDGWAKE
RKPSQSTVDQ WRKHCEAFLA FIGKDDAGRV TKADVVAWKD TLVAAGGAPK TINDSKLAAL
RVAFTWGVEN VRVRSNPATG VAVRQKMQAG EQMLGFDDGE AAAILQAAAK ETRPYIRWLP
LLCAASGARV GEMAQLRAED VIVQDGIPAL CITAEAGSLK NLNSERVIPL HPGVIDAGFL
EFVKGKKGPL FYNPVRRRTD AKKPSHKIVA KNVATWVQGL GLQVGRQHRK DPSHAWRHRF
KTLARAARIE DSVADAIVGH APGSVAKAYG TVTLATMHEA VSRIPIPKC