Gene Franean1_5747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5747 
Symbol 
ID5674073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6982506 
End bp6984884 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content72% 
IMG OID641244600 
Productheavy metal translocating P-type ATPase 
Protein accessionYP_001510003 
Protein GI158317495 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2217] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC
[TIGR01525] heavy metal translocating P-type ATPase
[TIGR01511] copper-(or silver)-translocating P-type ATPase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCA CCACCACCAC GACGCCGCAC GCCACCGACA CCCCGGTCAC CGAGATCGAG 
CTCGCGATCG GCGGGATGAC CTGCGCGTCC TGCGCCAACC GGATCGAACG CAAGCTGAAC
AAGCTCGACG GCGTCACCGC CACCGTCAAC TACGCCACCG AGAAGGCCCG GGTGAGCGCC
CCCGGCGGCG TCGACCCGAC CGTGCTGGTC GCCCAGGTGG AGGCAGCCGG CTACACCGCC
GAACTACCCC GGCCTCCGGC CGCGGACCTG GAACCAGCCG CAGGTGTGGC CGGCACCGAA
CCCGATCCGA CCCGGCCGCT GCGCCACCGC CTGATCACCA GCATCGTGCT CGCGGTTCCG
GTCATCGTGA TGGCGATGGT CCCAGCGCTG CAGTTCACGA ACTGGCAGTG GTTCTCGCTC
GTCCTGGCGA GCCCCGTCGT GACCTGGGCC GCCTGGCCCT TCCACCGCGC CGCCTGGACG
AACCTGCGAC ACGGCGCCGC GACGATGGAC ACGCTGATCT CCATCGGCGT CGGTGCGGCG
TTCGGCTGGT CGTTGTACGC GTTGCAGTTC GGCACAGCCG GGACGCCGGG GATGACGCAC
CCGTTCGAGC TGACGATCGC CCGCGGCGAC GGCACCGGCA ACATCTACCT GGAAGTCGCG
GCCGGCGTCA CCACCTTCAT CCTCGCCGGC CGGTACTTCG AGGCGCGCGC CAAGCGACAG
GCCGGCGCCG CGCTACGCGC GCTGCTCGAA CTCGGCGCCA AGGACGTCGC GGTCCTACGG
GACGGCGCGG AGGTCCGGGT CCCGATCGGG CAGCTGGCCG TCGGCGAGCG GTTCGTCGTC
CGGCCCGGCG AGCGGATCGC CACCGACGGC GTGGTCGACG AAGGCTCCTC GGCCGTCGAC
GCCTCGATGC TGACCGGCGA GTCCGTCCCC GTCGAGGTCG CCCCGGGCGA CGCCGTCACC
GGAGCGACGG TGAACGCCGG CGGACGACTG GTCGTGCGCG CCACCCGGGT AGGCGCCGAC
ACCCAGCTCG CCCAGATGGC CAAGCTCGTC GAGGATGCCC AGAACGGCAA GGCCGCCGCG
CAGCGGCTCG CCGACCGGAT CTCCGGGGTG TTCGTACCGG TCGTCATCAC CCTGGCCGCG
GCCACCCTGG CCTTCTGGCT CGGCGCGGAC GCCGGCCCGG ACGCCGCGTT CACCGCCGCG
GTCGCCGTGC TGATCATCGC CTGCCCGTGC GCCCTCGGGC TGGCCACCCC GACGGCACTG
CTGGTCGGAA CAGGCCGCGG AGCGCAGCTC GGCATCCTCA TCAAAGGCCC CGAAGTACTT
GAATCCACCC GACGCGTCGA CACGATCGTG CTCGACAAGA CCGGCACCGT CACCACGGGA
CGCATGAATC TGGTCGAGAT CCACCTCGCA GCCGCCGACG GCGACGAGAC CGGTGAGGTC
GACGAGACCG AGATGCTGCG GCTCGCCGGA GCGCTGGAGA ACGCGTCCGA ACATCCGATC
GCCGCCGCCA TCGCCCGCGC CGCCCGCGAG CGGGTCGGCG ACCTGCCCGG CGTCGAGGAG
TTCAGCAACC TCGAAGGTCT CGGCGTCCAG GGGATCGTCG ACGGGCACGC CGTGCTCGTC
GGACGAGCGA GCCTCCTGGA GAAGTGGAGC CAGCCGCTAC CGCCCGACCT CGTCACAGCC
AGGACAACCG CCGAGGAATC CGGCCGCACC GCCGTCGGTG TCGCCTGGGA CGGAAAGGCC
CGCGCCGTCC TCGTCGTCGC GGACGTCGCG AAACCCACCT CGGCACAGGC CGTCACCCAG
CTCCGCCGGC TGGGGCTCAC CCCGGTGCTA CTCACCGGCG ACAACACGAC CGTCGCCCGC
GCGGTCGCGA CCGAGGTCGG CATCGGCACC AGCCCGGACA CCCTCATCGC CGAGGTGCTG
CCCGCCGACA AGGTCGACGT CATCAAACGC CTGCAGGCCC AGGGCAAGGT CGTCGCGATG
GTCGGCGACG GCGTCAACGA CGCCCCCGCC CTCGCCCAGG CCGACCTCGG CCTCGCCATG
GGCACCGGCA CCGACGTCGC CATCGAGGCC AGCGATCTCA CCCTCGTCCG CGGCGACCTG
CGGGCCGCCG TCGACGCGAT CCGGCTCTCC CGCCGCACCC TGGCAACCAT CAGGGGAAAC
CTGTTCTGGG CTTTCGCCTA CAACCTCGCG GCACTACCCC TGGCTGCGGC AGGCCTGCTC
AACCCCATGA TCGCCGGCGC ATCCATGGCG TTCAGCTCGG TGTTCGTCGT CGCCAACAGC
CTACGACTGC GCCGCTTCAC CGCACAGGTC GACGTACCAC CCGCCACGAC GGCGCCCCTC
TCCCCATCCC GCCGGGACAC CGCGAAGACA TCCAGCTGA
 
Protein sequence
MSTTTTTTPH ATDTPVTEIE LAIGGMTCAS CANRIERKLN KLDGVTATVN YATEKARVSA 
PGGVDPTVLV AQVEAAGYTA ELPRPPAADL EPAAGVAGTE PDPTRPLRHR LITSIVLAVP
VIVMAMVPAL QFTNWQWFSL VLASPVVTWA AWPFHRAAWT NLRHGAATMD TLISIGVGAA
FGWSLYALQF GTAGTPGMTH PFELTIARGD GTGNIYLEVA AGVTTFILAG RYFEARAKRQ
AGAALRALLE LGAKDVAVLR DGAEVRVPIG QLAVGERFVV RPGERIATDG VVDEGSSAVD
ASMLTGESVP VEVAPGDAVT GATVNAGGRL VVRATRVGAD TQLAQMAKLV EDAQNGKAAA
QRLADRISGV FVPVVITLAA ATLAFWLGAD AGPDAAFTAA VAVLIIACPC ALGLATPTAL
LVGTGRGAQL GILIKGPEVL ESTRRVDTIV LDKTGTVTTG RMNLVEIHLA AADGDETGEV
DETEMLRLAG ALENASEHPI AAAIARAARE RVGDLPGVEE FSNLEGLGVQ GIVDGHAVLV
GRASLLEKWS QPLPPDLVTA RTTAEESGRT AVGVAWDGKA RAVLVVADVA KPTSAQAVTQ
LRRLGLTPVL LTGDNTTVAR AVATEVGIGT SPDTLIAEVL PADKVDVIKR LQAQGKVVAM
VGDGVNDAPA LAQADLGLAM GTGTDVAIEA SDLTLVRGDL RAAVDAIRLS RRTLATIRGN
LFWAFAYNLA ALPLAAAGLL NPMIAGASMA FSSVFVVANS LRLRRFTAQV DVPPATTAPL
SPSRRDTAKT SS