Gene GM21_1687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1687 
Symbol 
ID8137018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1966142 
End bp1967578 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content61% 
IMG OID644869299 
Producthypothetical protein 
Protein accessionYP_003021499 
Protein GI253700310 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones102 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACCA TCTACAAGAC ACCGGGTGTC TACATCGAAG AAATCCCGAA GTTTCCCCCT 
TCAGTCGCCC CGGTCGAAAC CGCGATTCCC GCGTTCATCG GTTACACGGA GAAAGCGGAC
AACTTGGTCC CAGGCGACCT CACCTTGAAG CCGACGCGCA TCTCCTCGCT GGTCGAGTAC
GAGAAGATGT TCGGCGGTGC GCAAAAGGAA GAGGACATAA ACGTCGACGT CCAGGAAACT
CAGGTCAACA GCGTGACGGT GGATCTGAAG GCAACGGCCA CGGTCACCGA AGCGGCCCGC
TCGAAGCACA TCATGTATTA CGCGATGCAG CTCTTCTTCG CCAACGGCGG AGGCCCCTGC
TACATCGTCT CGGTCGGCCC ATTCAAGGCG ACCTTCGGGG GCGCGCTGGT GGAAACGGAG
CTTCAGGCCG GGCTGGACAC CCTGGTGAAG AAGGACGAGC CGACACTGAT CGTCTTCCCG
GAGGCCCAAA GTCTGTCGAT CGCGGACTTC AAGACCCTGC ACGACGCAGC CCTCGCCCAG
TGCGCCGATC TTAAGGACCG GTTCGTGATC ATTGACGTCC ACGGCGACAG CATCTCCCTG
TCGGACCCTA ACGGGAACCT TTTGACCGCC GTCTCCAACT TCCGGACCAA CGGCATCGGG
ATGAACAACC TGAAGTACGG AGCGGCCTAC GCGCCCAACA TCGACACCGT TCTAGACTTC
CAGTACGACG ACAGCAAGGT CGACGTCACT ATCACCACCA ACGGCACCGC TGCGGCTCCG
GTGAAGCTCG ACACGCTGAA AACCGCCAAC AACCGCATCT ACGAGCAGGC CAAGGCCGGC
ATCAGGGACA TGATCTGCAA GATGCCTCCC TCCTCGTCAA TGGCCGGGAT CTACGCCGCC
GTAGACAACA GCCGCGGAGT CTGGAAGGCG CCGGCAAACC TGAGCGTCAA TTCGGTGATC
CAGCCGAGCA TCCAGTTCTC CAGCGTGGAG CAGGACCAGA TGAACGTCGA CCCGGTCGCC
GGCAAGTCGG TGAACGCCAT CCGCGCCTTC ACCGGGAAGG GCACCCTGGT GTGGGGGGCG
CGAACGCTAG CCGGCAACGA CAACGAGTGG CGCTACGTCA ACGTGCGCAG GCTCTTCAAC
TTCGTGGAAG AGTCGTGCAA AAAGGCGACG GAACCTTTTG TCTTCGAACC GAACGACGCC
AACACCTGGG TCCGGGTGCA AGGGATGATC GAGAACTTCC TTACGGTGAT CTGGAGACAG
GGGGCGCTGC AGGGGGTTAA ACCCGAGCAC GCCTTCTTCG TCGCCGTCGG ACTGGGGAAG
ACCATGACGG CCATCGACAT CCTGGAGGGG CGCATGATCG TCGAGGTCGG CTTGGCGGCG
GTGCGACCAG CGGAGTTCAT CATCCTCAGG TTCTCGCACA AGATGGCTGA ATCCTAA
 
Protein sequence
MATIYKTPGV YIEEIPKFPP SVAPVETAIP AFIGYTEKAD NLVPGDLTLK PTRISSLVEY 
EKMFGGAQKE EDINVDVQET QVNSVTVDLK ATATVTEAAR SKHIMYYAMQ LFFANGGGPC
YIVSVGPFKA TFGGALVETE LQAGLDTLVK KDEPTLIVFP EAQSLSIADF KTLHDAALAQ
CADLKDRFVI IDVHGDSISL SDPNGNLLTA VSNFRTNGIG MNNLKYGAAY APNIDTVLDF
QYDDSKVDVT ITTNGTAAAP VKLDTLKTAN NRIYEQAKAG IRDMICKMPP SSSMAGIYAA
VDNSRGVWKA PANLSVNSVI QPSIQFSSVE QDQMNVDPVA GKSVNAIRAF TGKGTLVWGA
RTLAGNDNEW RYVNVRRLFN FVEESCKKAT EPFVFEPNDA NTWVRVQGMI ENFLTVIWRQ
GALQGVKPEH AFFVAVGLGK TMTAIDILEG RMIVEVGLAA VRPAEFIILR FSHKMAES