Gene GM21_3502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3502 
Symbol 
ID8138874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4040570 
End bp4042153 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content61% 
IMG OID644871121 
Producthypothetical protein 
Protein accessionYP_003023281 
Protein GI253702092 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value2.7799500000000003e-32 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCCGGGC TTTTCGTCGC CGCCCTGCTG ATACGGCTAT ATTTCGTACC GTTTTTCAAG 
GTGATATCCG CCGACGGAGT TGGGTATGTA ACTGCCGCCC GGAGCCTCTC CCGGGGGAAT
CTAGGCGACC TCACCATCTA CGGGGTGGTC TACCCGAGCC TTACCGCGGC GCTGAATCTG
CTGACCGGCG ACATGGAACT GGCGGGGCGC TGTGTCTCGG CGTTCATGGG AAGCTTGCTG
GTCGTGCCGC TCTACCTGCT TGGGGTGGAA TTCTTCTCCA AGAGGGCAGG GCTTCTGGCC
TGCATCCTGG TCCTCGCCTG GCCCCCCCTG AGGATGTGGG CGGGCGAGGT GATGACGCAG
GGGACCTACA TAACGCTGAT GCTTGGCGGC GTGTACGCGA TGTGGGTGGC CTTCAGAAAA
GACTCCAGCC GCCTCTGCTT CGGTGCGGGC GTGCTGATGG CTTTTGCCTA CCTGACGCGC
CCCGAAGCTT TGGTCACCTT CCTTGCCGTA GGCGCAGCCC TGGTGGTTCC CGCACGGGTG
AAAGGGTTGT CCTGGAAAAG GATCGCCGGC CTAATCGCGG CGACAGGTGC CGGTTTCGCC
ATCCCGTTGA TCCCCTACGT TTTCCTCGTG CACAGCGTCA CAGGAAAATG GCAGCTTGCG
GGGAAGACCG CGAACACGCT CGCCGACGCG CTGAGCCAGT ACCTGCAGCG TCCCGACATG
AAGAACGAGG CCTCCTTCAA GGGGATCGGC GTACTCGACG TGATCCGGCT CTATCCGGAG
TTCCTTTGGG GGAACTTCCT CAAGAACCTG AAAGAAACCT TCCAGACCAT GGTCCCCACC
TACCTGTGGC TCCTCTCCTT CATCGGAATT ATCGGCTACG GCTGGAGCAG GGAAAAGTGC
GGCAGACAGC TGGTCCTCCT GGCGACCTTC GCGCCGTTAG CGGTGATCAT GGTCCTCTTC
TTCGTCGGGC CCGAGTACCT GCAGCCGTAT CTCCCCGTTC TCTTTCTCTG GGCCGCTTCC
GGTTTTCTGT TGCTGGAGGA GCGCCTGGCG TCGTCCTTGC GGCTGGATAG ATTCGAACTC
GTCTCCAGGA TGCGAAGGGG CATCCCGGCC TCCGCGCTCG TGGCGGGGTG GATCACGATT
TCTCTGCTCG TCGCACAGGT GCGGGAGATT AGCGACGAGC CGTACCACTA CTCCCAGGAC
GGCGGGCGGT ACGACCAGAA GAGGATCGGT CTGCGGCTCA AGAAGTTGCT CCCTCCGGGT
TCGCGGGTCA TGACCAGATG GGGGCGCATC ACCTTCTACT CCGAGATGGA GATGGTGATG
ATCCCGCAGG CGGGGTATCC GGAACTGCTG GATGCCATCC GCACCAGCAA GGTGAAGTAC
GTCATCGTCG ACGGGATGCT TACCGCCGCG CGTCCCCAGT TCGGCCTGCT CTACCGCCCC
CTGTTCGAGA CACCGGAGAC GATCGAGTAC AGTGAAAAGG AGGCAGGCGG GGAAGCCTAC
ATGCCCCTTC CCAACCTGAA GCTCATTTAC CTGCACAAGG ATCCTTCCAG CATCGGACTG
GCGGTGTACG AGGTGAAGTC GTGA
 
Protein sequence
MAGLFVAALL IRLYFVPFFK VISADGVGYV TAARSLSRGN LGDLTIYGVV YPSLTAALNL 
LTGDMELAGR CVSAFMGSLL VVPLYLLGVE FFSKRAGLLA CILVLAWPPL RMWAGEVMTQ
GTYITLMLGG VYAMWVAFRK DSSRLCFGAG VLMAFAYLTR PEALVTFLAV GAALVVPARV
KGLSWKRIAG LIAATGAGFA IPLIPYVFLV HSVTGKWQLA GKTANTLADA LSQYLQRPDM
KNEASFKGIG VLDVIRLYPE FLWGNFLKNL KETFQTMVPT YLWLLSFIGI IGYGWSREKC
GRQLVLLATF APLAVIMVLF FVGPEYLQPY LPVLFLWAAS GFLLLEERLA SSLRLDRFEL
VSRMRRGIPA SALVAGWITI SLLVAQVREI SDEPYHYSQD GGRYDQKRIG LRLKKLLPPG
SRVMTRWGRI TFYSEMEMVM IPQAGYPELL DAIRTSKVKY VIVDGMLTAA RPQFGLLYRP
LFETPETIEY SEKEAGGEAY MPLPNLKLIY LHKDPSSIGL AVYEVKS