Gene Franean1_3752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3752 
Symbol 
ID5672117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4443683 
End bp4445476 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content73% 
IMG OID641242633 
ProductABC transporter related 
Protein accessionYP_001508053 
Protein GI158315545 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCCG GAACGAGCCA ACTGACGAGC CCGCCGACAA GCCCGCTGGT CAGCCCACTG 
GTGACGATCC GGGATCTCCA CGTCGGCTAT GTCCGCCGCC GCCGGACGGC GCCCGCCGTG
CGCGGCGTGG ACCTGTCGAT CGGGTCCGGG GAGATCGTCG CCCTCGTCGG CGAGTCGGGC
TCGGGCAAGT CCACGATCGC GAACACGATC ATCGGCCTGC TGCCCGACAA CGCCCGCGTC
ACCGCCGGCA GCGTCCTCCT CCACACCGGC CCGAGCCCCG ACGATGTCGA CGGCCCGGCG
CTCGACGACC GTCTGGACCT CGACGAGCGC GGCCAGCGGC CGGTCGAGAT CGTCGGGGCC
AGAGAGAAGG TCCTGCGTGA GCTGCGCGGC CGGGTCGTCG GGCTGGTGCC GCAGGACCCG
ATGGTCGGGC TCAACCCCAC GCGCCGCATC GGCGGGCAGG TCGCCGAGGC GATCCGGCTG
CGCGGGGTGA GCGGCGACGA TCTCGACGCC GAGGTGCTCG AGTTCCTCGC GCAGGCCGGC
GTCGACAACC CGCAGCTGCG CGCCCGCCAG TACCCGCACG AACTGTCCGG TGGCCTGCGC
CAGCGCGTGC TCATCGGGAT CGCCCTGGCC GGCCGGCCCC GTCTGATCAT CGCCGACGAG
CCGACCAGCG CCCTCGACGT GACGGTGCAG CGCAGGATCC TCGACCACCT CGAAGGGCTC
GTCCGCGAGT CCGGGATCTC GCTGCTGATC ATCACGCACG ACCTCGCCGT CGCGGCCGAC
CGCGGTGATC GCGTGGTCGT CCTGCAGGAC GGGGAGATCG TCGAGCAGGG CCCACCGGGC
GACATCCTGG TGAGCCCGGC GCAGGACTAC ACCCGGCGAC TCATCGCGGC CGCGCCCGGT
CTGGCCCACG GCGGCCAGAT CGTGCCCCGC TTCCAGCTTC CACCCGAGCA GGTGGCCACC
CCGGACCCGG TGATCCGGCT GGAGAACGTC TCCAAGCGCT ACCCACTGCC GGGCCGGCAG
GCCGGCTCGT TCCTCGCCCT GGACGGGGTG TCGGTCGAGG TGGCCCGCGG GCGCACGCAC
GCGCTCGTCG GCGAGTCCGG CTCCGGCAAG ACCACGATGC TGCGCATCGC GTTCGGCTTC
GAGCAGGCGA CCGACGGACG GGTCAGCCTC GACGGCGTCG ACATCACCGG CCTGGGCTGG
AAGCAGACCC GGCTGCTGCG CCGCAAGGTC CAGCTCGTCC ACCAGAACCC GTACGCCTCG
CTCGACCCGC GCTTCACCGT CGGGCAGAGC ATCACCGAGC CGCTGGTGTC GTTCCGGATC
GGCGACCGCC GCACCCGCAC GGCCCGCGCG CTGGAGCTGC TCGACCAGGT GGCGCTGCCC
GGCTCCTACC TCGACCGGCT GCCGGCGGAG CTGTCCGGCG GCCAGCGCCA GCGCGTCGCC
ATCGCCCGCG CGCTCGCCCT CTCCCCCGAG CTGGTGCTGC TCGACGAGCC CGTGTCGGCG
CTCGACGTCT CCGTGCAGGA ACAGATCCTC GCGCTGCTGA CCGACCTGCA GGAGCGCCTC
GGCCTCAGCT ACCTGTTCGT CTCGCACGAT CTGGCCGTCG TCGCGCGGAT CTCGCACACG
GTGTCGGTAC TCAACCGCGG CCGGCAGGTC GAGTCCGGCT CCGTCGCCGC GGTATTCGGC
TCGCCGCGCA GCGAGTACAC CCGGGAGCTG ATCGACGCCG TCCCCGGCCA GCGCGGCGCC
ACCCGCCAGG CCGGCTCCCG GGACGACGAT GTGCGGGAGG TCGCACGGAC CTGA
 
Protein sequence
MTAGTSQLTS PPTSPLVSPL VTIRDLHVGY VRRRRTAPAV RGVDLSIGSG EIVALVGESG 
SGKSTIANTI IGLLPDNARV TAGSVLLHTG PSPDDVDGPA LDDRLDLDER GQRPVEIVGA
REKVLRELRG RVVGLVPQDP MVGLNPTRRI GGQVAEAIRL RGVSGDDLDA EVLEFLAQAG
VDNPQLRARQ YPHELSGGLR QRVLIGIALA GRPRLIIADE PTSALDVTVQ RRILDHLEGL
VRESGISLLI ITHDLAVAAD RGDRVVVLQD GEIVEQGPPG DILVSPAQDY TRRLIAAAPG
LAHGGQIVPR FQLPPEQVAT PDPVIRLENV SKRYPLPGRQ AGSFLALDGV SVEVARGRTH
ALVGESGSGK TTMLRIAFGF EQATDGRVSL DGVDITGLGW KQTRLLRRKV QLVHQNPYAS
LDPRFTVGQS ITEPLVSFRI GDRRTRTARA LELLDQVALP GSYLDRLPAE LSGGQRQRVA
IARALALSPE LVLLDEPVSA LDVSVQEQIL ALLTDLQERL GLSYLFVSHD LAVVARISHT
VSVLNRGRQV ESGSVAAVFG SPRSEYTREL IDAVPGQRGA TRQAGSRDDD VREVART