Gene GM21_2155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2155 
Symbol 
ID8137491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2518224 
End bp2519897 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content64% 
IMG OID644869770 
ProductNa+/Picotransporter 
Protein accessionYP_003021965 
Protein GI253700776 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1283] Na+/phosphate symporter 
TIGRFAM ID[TIGR00704] Na/Pi-cotransporter 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones113 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCCCA CTTATTGGTC CTACATCATA GAGGGGCTGG GCGGGCTGGC GCTTTTCATC 
CTCGGCATGC GCACCATGTC GGAGGGGCTG CAAAAGGTGA GCGGGGAACG TCTCCGCAGG
CTTTTGGAGA AGGTGACGGG CAACCGCCTC ACTGCCCCTT TGGTCGGAAG CTGCCTCGCC
TCGCTGCTGC AGTCCGGCAG CGCCGCGTCG GTCCTGGTGG TCGGCTTCGT CAACGCCGGG
CTCCTCTCGC TGTACCAGGC CCTCGGGGTG CTTCTGGGGA CCGGCATCGG CACCACCCTC
GCCATCCAGA TCATCGCCTT CCGGGTCACG GCGCTGGCGC TTCCCGCCAT CACGGTCGGG
GTCCTTTTAA GCTTCTTCTC CAAAAGCAGG CGCTTGTCGC AACTTGGCGG GCTGCTCCTC
GGGGTGGGGC TCGTCTTTTT CGGCCTCTCC ATCATCGAGG GGGCATCGCT TCCCTTGAGC
GAAAGCGCCA TCATCTCCGG GATGCGCGAG GGGCTTCCCT CGATCCGGCT GGCGGCGGTG
CTTCTGGGGG CCCTCCTGAC CTTCCTGGTG CAGTCGGGAA GCGCCACCTT GGGGATCGTG
ATCGCGCTCG CCTCCGCTGG AGTCCTCTCC TATGACGCCG CCATCGCCAT GGTGATCGGC
GAGGTGGCGG GAGCGGCGCT GATCCCGCTC ATCGCCTCCG TCGGGGGGAG CCACACCGCC
AAAAGGGCCG TCATCATCTA CCTCGGCATC AGCTGGGGCG CCATCGCGCT CGGGCTCGTC
TTCTTCCCGC TTTTCCTGCG CGCGGTCAAT ACAGTCTCCC CCGGCGACCT GTCGCTTTTG
CATCAGCCCG GCGCCGACCC CCATGCGGCA GCCCAGGCGT TGAGGCCGTA CATCGCCAGG
CACCTGGCCA ACGCCCACAC CATCTTTACC GTCGCGTCGC TCTTGATCTT CCTACCCTTG
CTCGGCTTTT TCACCCGTTC CGCGGAAACG CTTCTCCCCG CCAGGCGCTC CGAAAGCGAC
CCGCGCCCAC GATTCATCGA TAACCGGGTG ATCAAGACGC CCACCATCGC GCTGGTGCAG
GCCTGGAGCG AACTCTCCCG CATGGGGGGG CTCGCCGCCG CCATGTACCG CGAACTGGTG
TCGCAGTTCG ACTCCTACAA CCCGAAGCTC GTCGCGGCGA TCAGGGACAA GGAACTGGTG
CTCGACGTGC TGCACCGGGA CATGTCCCAT TTCCTGGTGG CGCTTTCCAG GGAGACCCTT
TCGCTGGAGC GGGCCGTGGA AATACCTGCC ATGCTGCAGA TGGTGAACGA GATGGAGCAG
GTGGGGGACC AGACCGAGGC GGTGCTGAAC TACCTGGTGC GCAAGAAAGA GGACCGGCTG
CGCTTTTCCA GCTCGGCCAT GGACGAACTG AAGCGCTTCG CCACCAAGGT GGGGGAGGTC
GTTTCCCTTT GCGAGCGAGT CCTGAAAGGG GAGGGGGAGG AAGACCCGGC ACCCCTGCGC
CAAGAAGTGG CCCTGCTTCA GGAGGAACTG CAGGCGAGCC ACCTGCGCCG GCTTAAAGTC
GGCAAGTGCA GCATCGTGGC GGGGCTTCTC TACGGCGACA TGATCATCGC CTTTTGCAAG
ATCTCCGAAC TCTGCTTCTC CATCATCTCT CAGAAAAAAG GAATCGCCGC ATGA
 
Protein sequence
MVPTYWSYII EGLGGLALFI LGMRTMSEGL QKVSGERLRR LLEKVTGNRL TAPLVGSCLA 
SLLQSGSAAS VLVVGFVNAG LLSLYQALGV LLGTGIGTTL AIQIIAFRVT ALALPAITVG
VLLSFFSKSR RLSQLGGLLL GVGLVFFGLS IIEGASLPLS ESAIISGMRE GLPSIRLAAV
LLGALLTFLV QSGSATLGIV IALASAGVLS YDAAIAMVIG EVAGAALIPL IASVGGSHTA
KRAVIIYLGI SWGAIALGLV FFPLFLRAVN TVSPGDLSLL HQPGADPHAA AQALRPYIAR
HLANAHTIFT VASLLIFLPL LGFFTRSAET LLPARRSESD PRPRFIDNRV IKTPTIALVQ
AWSELSRMGG LAAAMYRELV SQFDSYNPKL VAAIRDKELV LDVLHRDMSH FLVALSRETL
SLERAVEIPA MLQMVNEMEQ VGDQTEAVLN YLVRKKEDRL RFSSSAMDEL KRFATKVGEV
VSLCERVLKG EGEEDPAPLR QEVALLQEEL QASHLRRLKV GKCSIVAGLL YGDMIIAFCK
ISELCFSIIS QKKGIAA