Gene GM21_1242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1242 
Symbol 
ID8136567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1448604 
End bp1450253 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content61% 
IMG OID644868856 
Producthypothetical protein 
Protein accessionYP_003021061 
Protein GI253699872 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value8.74975e-30 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCAAA AGAATGGCAG TAGAGTGCTT ATGATGCTGT TTGTCTTTGC GATGACCCTG 
GTGTCCTTTC AGTTTTGGCG CACCGAGGCG CAGGCGCAAT CCTCTTATTT CACCAGCAGA
GGTTGCGTAA CTTGCCACGG CGCCCCCACG GTTACCACCT GTGCCGGGTG CCACTATCAC
AGCGGCACCT TGTCCGCCAC CACCAACAAG ACGACCTCCT ACGCCCCCGG CGAGACGGTG
ACTGTCACCC TGACCGCTTC AGGCGCACGC TCCGGCTGGA TCGGCGCACG CCTCTATAAC
CAGGCTGGCG TCGAAGTGGC GCGTTCCACG GGCGCCCAGA GCGGCATGGG TGGTGCGACA
ATCTATCCGG CTGTTCTCTC GGCTCCGGCC CCCGCCGCCG CAGGTACCTA TAGCTGGAGG
ATGGCTTATC TCGGCAATGA CCTCACGGGA TCCGGCGACG TGCACAGCGA GAAATCGGTC
AACGTTTCCG TAACCGTTGC CGCAGTCCCG GTCGCCGACA CCACCGCGCC GGTCGTAGCC
ACCTTCACCC TGCCGGCGAG CTCCACCAGC CTGAACGTTC CGGTCTCCGC CCTGAGCGCG
ACCGATAACG TCGCCGTCAC CGGCTACCTG GTGAACAAGG TCGCGAGCGC GCCGACCGCA
TCCGCAGCAG GGTGGAGCGC CACCGCTCCG ACAAGCGTCA CCGCCGTCGC CGGCAGCAAC
ACCTTCTACG CCTGGGCCAA GGACGCGGCC GGCAACATCT CGCTGGCAAA AAGCGCCAGC
GTCACCGTTA CCATCGAAAC GGCCGATGTC ACCGCACCGA CCGTCGTAGT TTCTACCCTC
GCAAACGGTT CCTACACCAA TAAAGCAACC CTCAACATCA GCGGCAACGT CAGCGATGAA
GGCGGCTTGC AGTCCTTCAC CGTCAACAGC CAGCCTGTAA TAGTAAACGC GGACGGTTCT
TTCAGCACAG CCTTCCCCTT GGTGGCCGGA GCAAACACCG TGACTATCGT TGCCACCGAC
GTGGCCGGCA ATCAGCAGAC CGATGTTCGT ACCATCAACT ATGATCCGAC CGCACCGGTG
CTTGCAGTCA CCGCCCCCGG CGACAACAGC ATCTCCGCCC AATCATTCAT AACGCTGACC
GGTACCATCA GCGAGACTTC CACTGTCACC GTCACCGGCA ATGATGGCAG CCAACAATCA
GCCGCCGTCA CTGGCAGCAA CTTCATCGCC ACCGCCAACC TCGTTGCCGG CGTCAACACC
ATCACCATCA CCGCAACCGA CCTGGCCGGC AACACCGCCA GCGCCAAGCG GACCGTAACC
TATGAAGGCG GAACGATGAC CATAGCTATC ACCAGGCCGA GCCAGGACAT CACCACCAGC
AGAAATTCCA TCGTTGTGGA GGGCAAGATC GTCGATGCAG TAGGCAAGAT CTCGGTAAGC
CTGCAGGTAA ACGGCCGCAT CTACTTTCCC AACGTCGACG AAAATGGTCT CTTCAAGCAG
GCACTCTTCT TCCAGAAGTC CGGTCTGTAC ACTATCCTCG TTACCGCCAA GGATGCTGCC
GGCAACAGCA GCACGGTGAC CCGCAACGTG ATCTTCCGCA AGTACGATGA CCACGACGAT
TACCATGGTG ATGAGCGCGA CGACGATTAA
 
Protein sequence
MNQKNGSRVL MMLFVFAMTL VSFQFWRTEA QAQSSYFTSR GCVTCHGAPT VTTCAGCHYH 
SGTLSATTNK TTSYAPGETV TVTLTASGAR SGWIGARLYN QAGVEVARST GAQSGMGGAT
IYPAVLSAPA PAAAGTYSWR MAYLGNDLTG SGDVHSEKSV NVSVTVAAVP VADTTAPVVA
TFTLPASSTS LNVPVSALSA TDNVAVTGYL VNKVASAPTA SAAGWSATAP TSVTAVAGSN
TFYAWAKDAA GNISLAKSAS VTVTIETADV TAPTVVVSTL ANGSYTNKAT LNISGNVSDE
GGLQSFTVNS QPVIVNADGS FSTAFPLVAG ANTVTIVATD VAGNQQTDVR TINYDPTAPV
LAVTAPGDNS ISAQSFITLT GTISETSTVT VTGNDGSQQS AAVTGSNFIA TANLVAGVNT
ITITATDLAG NTASAKRTVT YEGGTMTIAI TRPSQDITTS RNSIVVEGKI VDAVGKISVS
LQVNGRIYFP NVDENGLFKQ ALFFQKSGLY TILVTAKDAA GNSSTVTRNV IFRKYDDHDD
YHGDERDDD