Gene GM21_0813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0813 
Symbol 
ID8136129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp967474 
End bp968757 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content61% 
IMG OID644868428 
Productintegrase family protein 
Protein accessionYP_003020642 
Protein GI253699453 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value4.71518e-11 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGTTACT ACCTTCGTGA GGGGCGAGGC TTCGCCCTGC GGGTCATGCC GTCCGGTCTC 
AAGAGCTTCG TCTATATCTA CGAACTGAAC AAGCGGAAAG GGTACCTGCT GCTCGGCCAC
TATCCAGGCT GCTCTCTCGC CGAAGCACGC ATCGCCTTCA ACCACGCTTT CAACCTGGTA
AAGAGAGGGA TCGACCCGCT TGGTCAGAAG AAAACCGTGG CAGAGGAGCG CGACAGGGCG
GTCAGGGAAG CCGCCCGGGA GGCGGAGGCC CGTGCCGTCG CGGCGGACCA CCTGGAGCGA
CTTACTTTTG AGACCCTGCT GAAAGACGGG ATCCCGGACG ACTTCACCCC CAAAACGGTG
GAACAGCTCG CCGCAGTCTG GATGATCAGG TACTCGAGGG CTAACCACAC GGAGCGCTGG
CAGGGAAGCG AACTCTGCTC TCTCAGGCTG CACATCCTTC CTGCCCTGGG AAAGGACGAT
ATAACCGCGG TACGGCGCAA GCACGCAGTG ACCTTCATCG AGCGGCTCGC CGCCAGATTC
CCAGGTGCCG CCCGAAACGC CATGAAGCTA TGCCGGCAGA TGTTCAAGTA CGCTTACCGC
CAGGAATGGG CGGAGATCCA GCCGTTCAAC GAGATAACGG AGTCGGTGCC GAAGATCGCC
CCGCGGGCCG ACGAGCGCCA CCTGGACGAC GACGAGATCG TCAGGGCATG GCGCGAGATC
AGCGACTCGG CGAGCTCCCT CTACGTCAAG CGGGCATTGA AGCTTATCCT CGTCACCGCA
CAGCGGCCGG GGGAGGTCGC CCAGATGCAC CGCAACCAGA TCAAGGAGAG ATGGTGGACC
ATCCCGGCGG AGGTGGCAAA GAATCGGAGG GATCATCGCG TGTACCTAAC GGACACGGCC
CTCGAGCTGG TCGGGGACGG CGCCGGTTAC ATCTTCCCCT CTGAGAAGGG AAAAACCCCC
TTCCTCTCCG CTAACAGCCT TTCCCAAGCG ATCAACCGCG GCTACCCGGC TGTGGAAGCC
ACGAAGCTAG TGGGGAACCA GACTATCAAG GCCGCGAAGA ATTGCTATTT CGGCATGAAG
CCCTGGTCGC CCCAGGACCT GCGCCGTACC GCCCGCACCA ACATGGCGCG GGTGGGAGTC
ATCGACGAGA TCGCCGAGGA GGTGGTGAAC CACAAGAAAT CAGGGATCGT CGGGGTCTAC
AACAAGTACC GCTACGATAA AGAGAAGGAA GTGGCCCTGA CCAAGTGGGA GCAGCTACTG
ATCGAGATAC TGAAGGGGGG CTGA
 
Protein sequence
MRYYLREGRG FALRVMPSGL KSFVYIYELN KRKGYLLLGH YPGCSLAEAR IAFNHAFNLV 
KRGIDPLGQK KTVAEERDRA VREAAREAEA RAVAADHLER LTFETLLKDG IPDDFTPKTV
EQLAAVWMIR YSRANHTERW QGSELCSLRL HILPALGKDD ITAVRRKHAV TFIERLAARF
PGAARNAMKL CRQMFKYAYR QEWAEIQPFN EITESVPKIA PRADERHLDD DEIVRAWREI
SDSASSLYVK RALKLILVTA QRPGEVAQMH RNQIKERWWT IPAEVAKNRR DHRVYLTDTA
LELVGDGAGY IFPSEKGKTP FLSANSLSQA INRGYPAVEA TKLVGNQTIK AAKNCYFGMK
PWSPQDLRRT ARTNMARVGV IDEIAEEVVN HKKSGIVGVY NKYRYDKEKE VALTKWEQLL
IEILKGG