Gene GM21_1203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1203 
Symbol 
ID8136528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1407548 
End bp1408942 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content64% 
IMG OID644868817 
Producttype VI secretion protein, VC_A0114 family 
Protein accessionYP_003021022 
Protein GI253699833 
COG category[S] Function unknown 
COG ID[COG3522] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03353] type VI secretion protein, VC_A0114 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones170 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTCCA TCGAGAGGCA GGTATTCTGG CACCAGGGGC TGTTTCTGCA GCCCCAGCAT 
TTCCAGCTCT CAGAGCGCTC GTTGCAGTCG CAACTGGCTC CGTACCAGTT GTGTCTCATG
CCTGACTTTT GGGGCGTGCA ACGGATGGAG ATCAGGTGCA CAGCCGGCGG GATACCTAGC
CTCGAAATTT CCGGCGCCTT CCTTTTTCCC GACGGCACCT ATGCCGTCGT CGCTGAGAAC
GCGCGGGCCG AGTCCAGGCC GGTCCTCGAA GAGGCAGTGG CGGGAAGGGA TTCCTGTACC
GTCTACCTTG GCTTGAAGAA ATGGAGCCCC GCCGGCCATA ACGTCACCAC CTTGCTCCCT
GGCGCCCCCC TCTCCAAGGT CGCTACGCGC TTCGTTGCCG AGTCCGACGC CGCGCCCCGT
GCCGATCTCC ACGCAGGGGG GGCTGAGGCG GAGGTGCGGC AGATGGCTTT GACCCTGCAG
CTTTTCTGGG AGAGCGAGCT GGAGTCGTTG GGGGATTACC TGCTCATCCC TGTCGCCCGG
CTCGCGCTGC GCGGCGAAAC CGCGGTGCTC TCCCGTGACT TTATCCCCCC CTGCATCACC
CTGTCCGGCT CATCCGCGCT CTTCGACCTG CTCCAGGAAA TCCGGGAACA GCTGGCCTCC
CGCTGCCGCT GGCTGGAGGG TTACAAGAAG GAACGGGGCA TTCAGGCCGC TGAGTTCGGT
TCCAAAGATC TGGTTTTCCT CCTGGCCCTC AGAACCGTGA GCCGGCACCT AGCGAGGCTT
AGCCACTGGA TCGAGGCGGG TGAGGTTCAC CCTTGGCAGG TTTACGGGCT TCTGGGAGAG
CTCGCCGCGG AACTGACCTG CTTCTCGGAG ACCACCGGCG CCTTCGGTGA GTCCGTGGCC
GACGGGCCAA GGCTCATGCC GCAGTACCGG CACAAAGATC TGGGGTGCTG CTTCCGGCTG
GCCCGCGATC TCATCGTCCA GCTGCTGAAC GAGGTGACCG CGGGGCCGGA ATACGCACTG
ACCCTCGCCT TCGACGGGAC CTGGTTCGCC TCCGATCTCA AGCCCGCGCA CTTCCAGGGG
CACAGCAGGT TTTACCTGGT GCTGAATACG AACGAGGATC CGAAACTCGT CCTTGCTTCG
GTCGCGACCG CAGCGAAGCT GACCGCGCGC GAACGGTTGC CGCTGTTGAT CTCCCAGGCG
CTCCCGGGGA TCGCCCTGGA GCATGTCTCC GATCCCCCCC GCGAACTGCC GCATCGCTCC
ACCTCCCTCT TTTTCAGCAT AGACAGCCGC TGCGATCAGT GGGAGCTGGT GCGGAAGTGG
AATAATATCG CGCTCAGCTG GGACCAGGCC CCAGCCGACC TCGAAGTGCA GCTCATGATC
GTTGCCAGGT CCTAG
 
Protein sequence
MMSIERQVFW HQGLFLQPQH FQLSERSLQS QLAPYQLCLM PDFWGVQRME IRCTAGGIPS 
LEISGAFLFP DGTYAVVAEN ARAESRPVLE EAVAGRDSCT VYLGLKKWSP AGHNVTTLLP
GAPLSKVATR FVAESDAAPR ADLHAGGAEA EVRQMALTLQ LFWESELESL GDYLLIPVAR
LALRGETAVL SRDFIPPCIT LSGSSALFDL LQEIREQLAS RCRWLEGYKK ERGIQAAEFG
SKDLVFLLAL RTVSRHLARL SHWIEAGEVH PWQVYGLLGE LAAELTCFSE TTGAFGESVA
DGPRLMPQYR HKDLGCCFRL ARDLIVQLLN EVTAGPEYAL TLAFDGTWFA SDLKPAHFQG
HSRFYLVLNT NEDPKLVLAS VATAAKLTAR ERLPLLISQA LPGIALEHVS DPPRELPHRS
TSLFFSIDSR CDQWELVRKW NNIALSWDQA PADLEVQLMI VARS