Gene GM21_4063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4063 
Symbol 
ID8139437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4646890 
End bp4648215 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content52% 
IMG OID644871679 
Productintegrase family protein 
Protein accessionYP_003023837 
Protein GI253702648 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones127 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCCT TCCTTAAGAT AAAAACCGTG ATCTTGGCGA ACGGAGGTCG GCGGCCGATG 
TTGATTGACC GGACAGACGG CATGCCGTTA GTTAACCCCA CCTTCTACAT CACCTCTATG
GTCTACCTCC CGGGACTTGA ATTGAACACC CAAAAACAGG TCCTCTCCGC GATCCAGTTC
CTCTACGAGT GGAGCAAGAG GAACAGAATC GACCTTGAAG AACGGTTCGG GTTGGGCCAC
TTCCTCGAGC TCCGAGAAAT AGAAAAGGTA TGCGTGGATA TTAGAATGGA TTTCCGCAAT
TACTGCTTTG ACCCTCTTGC GTTTGCTGGA GCCAGTCGGC CAATGCAGAG GACGAAAACG
ATTGTCCGGC TTCACAGGCC GCGCAAACCA AGATACCTTA ACAACATGTA TAACTGCGGC
GCGACCACAT CCATCAAGCT CTCGTATATC AAGAATTACC TCGACTGGTT GGCTGCCGAG
ACAATTGGGA GATCTTCCGC GAGGGAACCG GAATTTTCCT CAATGCAGAC CTCCCGCGCT
GAGATGGTCA AGTGGCTCAC GGAGCGAATC CCTTCGGCAG GGGAGTCGCC ACTAAAAAGA
GGACTTACGC CTGAGGCGCG GACGCGACTT CTCGACGTGA TCGACCCGAA GCATCCAGAC
AATCCGTTTA AAAGCGCATT CGTCCGTGAA CGCAACCGCC TGATCATTTT GTATCTCGAC
CGCCTCGGGA TACGGAGGGC TGAAGCGCTT CTGATAAAGT TGGGGAAATT TCTTAACGTA
TTTCCCTCTG CGGGATGCCA GGAGGGCTCC GTGGAGATTC GCGAGCACGT AAATGATCCT
GAGGATACAA GACGTAACAG GCCACAGCTG AAAACTGCTG AGCGTCCCCT ACCGATCGGC
ATGGAACTTT GCAGTTTGAC TCGGGACTTC ATAAACCTTT ACCGCAGCAA GATTCCGCGC
GCGCGCTGCC ACGGCTACTT ATTTGTATCA AGGAGCGGAG AGCCGCTGAC GCTGAGCAGT
CTAGACGATA TTTTTGCAAA GGTTAGAACT GTTGAAGGTA TACCTGACCT TATTTCCGCG
CACCTACTTC GTTACACTTG GAATGACAGG TTCTCCGAAT TTGCTGACCA GATGATCAAG
TCTGGTGAAT GGAAAAGCAA GGACGAAGAG GAAATTCGGC GTTTGCAGCA GGGTTGGAGT
CCAGATTCCA AAATGCCGGG CAAGTACAGT CGGCGGTTTC TTGAGAATAA GACACGGCAG
GTCAGCATTC ATCTGCAGGA AAACCTGTAC ACCGTCAAAA TTCCTGACCT AACACCGGAG
AAATAA
 
Protein sequence
MSSFLKIKTV ILANGGRRPM LIDRTDGMPL VNPTFYITSM VYLPGLELNT QKQVLSAIQF 
LYEWSKRNRI DLEERFGLGH FLELREIEKV CVDIRMDFRN YCFDPLAFAG ASRPMQRTKT
IVRLHRPRKP RYLNNMYNCG ATTSIKLSYI KNYLDWLAAE TIGRSSAREP EFSSMQTSRA
EMVKWLTERI PSAGESPLKR GLTPEARTRL LDVIDPKHPD NPFKSAFVRE RNRLIILYLD
RLGIRRAEAL LIKLGKFLNV FPSAGCQEGS VEIREHVNDP EDTRRNRPQL KTAERPLPIG
MELCSLTRDF INLYRSKIPR ARCHGYLFVS RSGEPLTLSS LDDIFAKVRT VEGIPDLISA
HLLRYTWNDR FSEFADQMIK SGEWKSKDEE EIRRLQQGWS PDSKMPGKYS RRFLENKTRQ
VSIHLQENLY TVKIPDLTPE K