Gene GM21_1254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1254 
Symbol 
ID8136580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1462957 
End bp1464177 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content52% 
IMG OID644868868 
Productintegrase family protein 
Protein accessionYP_003021073 
Protein GI253699884 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones88 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTGA CGGATGTAAA GGTGAGAACA GCGAAGCCCG GAGAAAAGCA AGTGAAGCTA 
TCCGATGGTG ATGGAATGTA TCTCCTTGTT ACTCCCAACG GCGGGAAGTG TTGGCGGCTC
AAATACCGCT TTGGGGGCAA GGAAAAAGTG TTGGCGCTTG GCACCTATCC CGAAATCTCA
CTTGTAGACG CACGCCAACG CCGACATGAA GCGAGAAAAC TTTTAGCTAA CGGTGTTGAC
CCTAACGAGA TCAAGAAGGC TCAGAAGGCC GCTACGATTA CCGAGACAGA AAATAGCTTT
GAAGTAGTGG CGCGGGAATG GCACCAGAAA TTCTCACCTT CTTGGTCAGG GGTTCACTCT
GACACGACAT TGCGCCGGTT ACAAGCTGAC GTGTTCCCTG CAATAGGGGC GCGGCCTATA
TCGGAGATCA AAGCGCCGGA TGCGTTGGCT ATGTTGCGGC GGATTGAATC GAGGGGGGCG
CTTGAGACAG CACATCGAAT TCGGACCATC TGCGGCCAAG TTTTCCGCTA TGCTGTGGCC
ACTGGTCGCG CCGAGAGGGA TTGCACTGCT GATCTTAAAG GAGCATTGCC ACCCTGTAAA
AAAAGCCACC TTGCTGCAAT AACGGACCCC AAAGCCGTGG CACCCTTGCT TCGGGCCATA
GATGGGTATC AAGGTACGAT TCCGGTGAAA TCTGCGTTGC AATTGGCTGC AATGTTTTTC
GTCCGCCCTG GGGAGTTGAG GCAGGCTGAA TGGTCAGAGA TAGACTTTGA AAATGCGCTA
TGGAATATTC CCGCCAGTCG CATGAAAATG AAAGAACCGC ACATTGTCCC GTTGGCAAGT
CATGCAATTG CAATTCTGAA GGAACTGCAG GCCCTCACCG GCAGAAGCCG TTTCCTTTTC
CCTTCCGGCA GGTCGTTCAC CCGTCCTATG TCCAACAATG CCATAAATGC AGCCTTGCGG
CGCATGGGTT TTGACAAGGA TGAAATGACC GGACATGGCT TCCGAGCTAT GGCGCGTACA
ATACTCGATG AAGTGTTGCA ATTCCGTCCT GATTTTATCG AGCATCAACT AGCACATGCC
GTTAAAGACC CCAACGGACG CGCCTACAAC CGGACGGCGC ACCTTGCTGA GCGTCGGAAG
ATGATGCAGA CATGGGCGGA GTATTTGGAT GATTTAAAAA AAGGCCCGAA GGTTTTACTA
TTTAAGAGAG CGGAAGGATA A
 
Protein sequence
MPLTDVKVRT AKPGEKQVKL SDGDGMYLLV TPNGGKCWRL KYRFGGKEKV LALGTYPEIS 
LVDARQRRHE ARKLLANGVD PNEIKKAQKA ATITETENSF EVVAREWHQK FSPSWSGVHS
DTTLRRLQAD VFPAIGARPI SEIKAPDALA MLRRIESRGA LETAHRIRTI CGQVFRYAVA
TGRAERDCTA DLKGALPPCK KSHLAAITDP KAVAPLLRAI DGYQGTIPVK SALQLAAMFF
VRPGELRQAE WSEIDFENAL WNIPASRMKM KEPHIVPLAS HAIAILKELQ ALTGRSRFLF
PSGRSFTRPM SNNAINAALR RMGFDKDEMT GHGFRAMART ILDEVLQFRP DFIEHQLAHA
VKDPNGRAYN RTAHLAERRK MMQTWAEYLD DLKKGPKVLL FKRAEG