Gene GM21_3549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3549 
Symbol 
ID8138921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4110327 
End bp4111487 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content69% 
IMG OID644871168 
Productprotein of unknown function DUF214 
Protein accessionYP_003023328 
Protein GI253702139 
COG category[V] Defense mechanisms 
COG ID[COG0577] ABC-type antimicrobial peptide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones152 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAC GCCGCTGGCT CTCCCATATC GTCATCAGGG CGCTGGCCCA CAGAAAGGGG 
CGCACCGCGC TCCTCGTCGC CGTCCTCACC ATGGCGTCGA GCCTCGCCAC CGCCCTTTGC
ATCGTCTCCG CCTCGATGGG GGAGAGGGTG GCCGAGGAGA CCAGGCGCTA CGGAGCGAAC
CTCCTGATCC TCCCCGAGGC CGCCCGCATC GAGGTGGGAA GCGGCGCCCT CAGGTTCGGG
ACCGTCGGCG AGCCGGCCTA CCTGGACCAG GAACAGGTGG TCTCGGTGCT TGCCGCGAGC
GGCGCGGGGG AGGATTATTC CCTGCACCTC AAAGCGGCGC TCACCCTGAA CGGGGCCGAG
CTTCCCTGCG AAGGGGTCGA GTTCGACCGG GTGCGGCGGC TTGCCCCCTG GTGGCAGTTG
CGCGGCGCCT GGCCCAAAGC GGGCGAGGCG CTGGTGGGTA CCGACCTTGC GGCCCGCTAC
CGCCTTAAGC CCGGCGACAC GCTGGCGCTC GGCGGGAAGA GCGCGACCCT CAAGGTCGCC
GTCGCCGGTA TCGTCAGCAC CGGCGGCGAG GAGGACGGCG TGCTCTTCCT CCATTTGAAT
GAGCTGCAGC GGGAGGCGGG GCATCCGGGA GAGGTTAGCC TCGTGCGGTT GCTGGTAGAT
CCCAGCCGGG GGAGCGTCAA GGGGAAGGCG AAAGAGCTGC AGCCGCAGCT CTCGGGCGCG
GTGGTGAAGG AGTTGCGCCA GGTGGGGCGG ACCAGCGAGG AGCTCCTCGG GAAGGTACAG
CTTTTGATGC TGTTGGTGAC GCTGGTGGTC CTTGTCTGCG CCGGGAGCAG CGTCGCCGGG
ACCATGAGCG CCACCGTGCT GGAGCGCGGC AAGGAAATCG GGCTCATGAA GGCGATGGGG
GGGACCCGCT GGGACCTCTT GCGCATCTTC AGCGCCGAGG CGCTGCTTTT GGGGGGCGCC
GCGGGGATGA CCGGGTATCT GTTGGGGAGC GCCATCGCCC AGTTCGTGGC GCGGAGCGTT
TTCGCCGCCT CCGCCGGTTT CGCCCCGGCC TATTTCCCGG TGGCGCTGGG AGTGAGTCTC
TCGCTGGCGC TCGCCGGGAG CCTCGGCCCG CTCGTCTCCG TGTTCCGGCT CGACCCGGTG
CAAAGTCTGC GCGGAGAATA A
 
Protein sequence
MSKRRWLSHI VIRALAHRKG RTALLVAVLT MASSLATALC IVSASMGERV AEETRRYGAN 
LLILPEAARI EVGSGALRFG TVGEPAYLDQ EQVVSVLAAS GAGEDYSLHL KAALTLNGAE
LPCEGVEFDR VRRLAPWWQL RGAWPKAGEA LVGTDLAARY RLKPGDTLAL GGKSATLKVA
VAGIVSTGGE EDGVLFLHLN ELQREAGHPG EVSLVRLLVD PSRGSVKGKA KELQPQLSGA
VVKELRQVGR TSEELLGKVQ LLMLLVTLVV LVCAGSSVAG TMSATVLERG KEIGLMKAMG
GTRWDLLRIF SAEALLLGGA AGMTGYLLGS AIAQFVARSV FAASAGFAPA YFPVALGVSL
SLALAGSLGP LVSVFRLDPV QSLRGE