Gene GM21_1447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1447 
Symbol 
ID8136776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1704183 
End bp1705475 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content46% 
IMG OID644869060 
Productintegrase family protein 
Protein accessionYP_003021262 
Protein GI253700073 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones103 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCATATC CAACCCATCT GATACGTCGT AATGGTCATT ACCACTACAA GATCAAAGTC 
CCCGTAGATC TCCAGCAGCA CTTCCCCTGC ACCTTCATAA ACCGATCTCT TAAGACCACC
GATCTCCAGG AAGCCAAGAC CATCTTGGTA GGGATGGAGT ATAGAATTCA TAGGGTCTTC
ACCCTGTTGC GTACCGGTAT GCTATCTGAA GACATGACCA AGCAGGTTGT CAGTGATATC
GTGCCAGTAA GGGCAAAGTC GGTGGTAGCA AAAGGCTTAA TGTTGTCTGC TGTCATCAAG
CAATATGTCT TTGAAAAAGA GGCTCAATGG ACGCCTAAGA CCAAGATGGA AATGGCAGGG
GTCTTTAAGG TGGTTTTGGA TGTTCTTGGG GATGTCGATG TGAAGAGCCT CAACAGGCAA
GCTTTGCTAG ACATGAGGTC AACTCTGATG AAGCTCCCTT CCAATATGTA TAAGAGGTAT
CCTGGTCTGA CAGTGGGGCA GTTGTTGGAG ATGAGTGATA TCACGCCAAT GAGCATCAAG
TCGGTAAACA AGTATATGAA AGGAGTAGGG GCTGTATTAC GCTACTGTGC TAAAGAATGC
CTGATCGTGG TCAACTACGC CGATGGCCTC AAGATAATGG AAAAAAGCAA GCCTGACCAG
GAGCGGAGCA TATATGACAA TGCCGATATG AAGAGGATAT TCGATAACTT GCCACGCAAG
GAGAAATATC CTGAAAGGTA CTGGATACCC TTGATAGGCT GTTACTCGGG GATGCGCCTC
AATGAAATCT GTCAATTATA CGTTGAGGAC ATTCAACAGG TAGAAGGTAT ATGGTGCTTT
AATATAAATG GAGACAAAGA TAAGAGACTG AAGAACCAAA CAAGTGAGCG GATCATACCG
ATACATCCGA AGCTGATCGA GCTTGGGCTG ATAGAATACT GGCAGGTTGT CAAAGAGTCA
GGTGTTCCGA GACTATGGAT GGAGCTTACT TGGATGGATG TGAACGGTTA CAGTAACAAT
TTTGGTAAAT GGTATCAGCG GTTTAACAGG GAATGCGTGA CGTCCGATCC GAAAAAAGTG
TTTCATTCGT TCAGGCATGT TGTGACAGAT ACGTTGAAGC AGGCCGGTGT GCAGGACTCG
ATTATTGCTG AACTGGTAGG CCACAGTCAA GGCACCCACT CCATGACCAT GAGCCGGTAC
GGGAAGAGGT ATCAGCCGAA GGTGCTCCTG GAAGCAATGG TGCACCTAGA TTACGGCATC
GAAATATCCC CTATGCAAGT ACCTGGATTA TAA
 
Protein sequence
MSYPTHLIRR NGHYHYKIKV PVDLQQHFPC TFINRSLKTT DLQEAKTILV GMEYRIHRVF 
TLLRTGMLSE DMTKQVVSDI VPVRAKSVVA KGLMLSAVIK QYVFEKEAQW TPKTKMEMAG
VFKVVLDVLG DVDVKSLNRQ ALLDMRSTLM KLPSNMYKRY PGLTVGQLLE MSDITPMSIK
SVNKYMKGVG AVLRYCAKEC LIVVNYADGL KIMEKSKPDQ ERSIYDNADM KRIFDNLPRK
EKYPERYWIP LIGCYSGMRL NEICQLYVED IQQVEGIWCF NINGDKDKRL KNQTSERIIP
IHPKLIELGL IEYWQVVKES GVPRLWMELT WMDVNGYSNN FGKWYQRFNR ECVTSDPKKV
FHSFRHVVTD TLKQAGVQDS IIAELVGHSQ GTHSMTMSRY GKRYQPKVLL EAMVHLDYGI
EISPMQVPGL