Gene GM21_2970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2970 
Symbol 
ID8138313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3451871 
End bp3453055 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content49% 
IMG OID644870568 
Producttransposase IS4 family protein 
Protein accessionYP_003022757 
Protein GI253701568 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.0000012671 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGACACT CCCTTAGGAC TTCCACCCTC CGTGCCCGTC TAAGATTCAA AAAGGCTTTT 
AAGCCCATGT ACGATCCGGT ACAAAGGTAC CTTAGCGTCA TTACTCCGCT TACCTCCAAA
GGGGATCGTC CATTGCAACT CTCTTTCGAA GACCAGCTTA AGGCCCTTAT CTACTTCCAC
CTTCACGAGT TCTCCTCCGG TAGGGAACTG TTACAAGCTC TGGAGCAGGA CGATTTCGCT
AAAGAGTGTG TCGCACCACC CAAGGGAATT AAGAAAAGCG CTTTCTTTGA AGCTGTCAAC
AACCGTGGCT TGGAACAACT AGCAGAGCTT TTTAAGCTTT TACTTAAGGA CGCTAAAAAC
GTCATTCCAG CCGAGTTTGC AGATATTGGA AATTTAGTTG CCATAGACGG TTCATATATC
GACGCCGTCA TGTCTATGGA TTGGGCGGAT TATTCAAGCA CCCACAACAA AGCCAAAGCC
CATGTTGCGT TTGATATAAA CCGTGGCATC CCGAAAGATT TAATCCTCAC CGACGGCAAC
CAGACCGAAC GCCAATTCGT AGAAAGGATG ATCGGCCCCG ATGAAACGGC TGTATTGGAT
CGAGGCTATC AGTGCAACGC TAACTTTGAT CAGTGGCAGG AAAACGAAAA AAAGTTCATC
TGCCGCATCC AAGCAAGATC TAACAAAAAA GTCATACGCG AAAACCCCAT CGCGCGGGGT
AGCATCATCT TCTACGATGC TGTTGTCCTT CTCGGAGCAC CATCTACTAG GGCGAAAAAA
GAGGTTCGCG TGGTTGCTTA CCGGGTTGAG GGCAAAGATT TCTGGATTGC GACTAACCGT
CATGATTTGA CTGCGCTGCA AATCGCTGAG GCTTACAAGC TGCGTTGGCA CATTGAGAGC
TTTTTCGCAT GGTGGAAACG ACATCTCAGC GTTTACCATC TCATCGCCAG GAGCCAGTAT
GGCTTAACAG TCCAAATACT CAGTGGGCTC ATCACCTACC TGCTTCTGGC GATGTACTGC
CAGCGAGAAC ACAACGAGCC AGTAAGTGTT CACCGTGTTC GGGAACTGCG GCATCAAATG
GCCCGCGACG CAGTCGCAAT GACATCACAA ACGCCGTCTC CCAAAAGGGC AAAACTGCAA
AGAAACACAC GATTGCTGAT GAAGCGACGT AAAGCAAAAA CCTAA
 
Protein sequence
MRHSLRTSTL RARLRFKKAF KPMYDPVQRY LSVITPLTSK GDRPLQLSFE DQLKALIYFH 
LHEFSSGREL LQALEQDDFA KECVAPPKGI KKSAFFEAVN NRGLEQLAEL FKLLLKDAKN
VIPAEFADIG NLVAIDGSYI DAVMSMDWAD YSSTHNKAKA HVAFDINRGI PKDLILTDGN
QTERQFVERM IGPDETAVLD RGYQCNANFD QWQENEKKFI CRIQARSNKK VIRENPIARG
SIIFYDAVVL LGAPSTRAKK EVRVVAYRVE GKDFWIATNR HDLTALQIAE AYKLRWHIES
FFAWWKRHLS VYHLIARSQY GLTVQILSGL ITYLLLAMYC QREHNEPVSV HRVRELRHQM
ARDAVAMTSQ TPSPKRAKLQ RNTRLLMKRR KAKT