Gene Franean1_4819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4819 
Symbol 
ID5673160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5752854 
End bp5754713 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content74% 
IMG OID641243675 
ProductABC transporter related 
Protein accessionYP_001509091 
Protein GI158316583 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0993193 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACGG TTCCCGATGT GGCCCCGGAA CCCGTGGCCC CGGAACCCGT GGCCCGGGAG 
CCTGTGACGG CGGGCCCTGT CGCGGCGGAG CGCGTGGCCG CGGAGCGGGA CGAACGGCCT
GCCCTGTTGG AGATCGACGG ACTGGAGGTC GACTACCGGC CACGCGGTGA GGTCGTGCAC
GCCGTGCGCT CGATCAGTCT CCGGGTCGCC GCCGGGGAGG TCGTGGCGAT CGTCGGCGAG
TCCGGTTCGG GGAAGTCCAC GACCGCGCAC GCGGTGCTGC GACTGCTGCC GCGTTCGGCC
CGGATCGCCG CCGGCCGGAT CCGGTTCGAC GGCCAGGACG TCACCGGCCT GAGCGAGCGC
CGTTTCCGGG CTTTGCGCGG TCGGGACATC GCGCTCATCC CGCAGGAGCC GACGACGTCC
CTCAACCCCA CCCAGCGGAT CGGTACGCAG GTCGCCGAGG TGCTGATCGT CCACGGCCTG
GCCGACCGGC GCACGGCCGG GCGGGAGGCA CTGCGGGTGC TGGCGGACGC GGGCATGCCC
GAGCCGGAAC TGCGGGCCCG CCAGTACCCG TCGCAGCTCT CGGGCGGCCT GTGCCAGCGG
GCCCTGATCT CCGTCGCCAT CGCGGCGAAG CCGCGCCTGA TCGTCGCGGA CGAGCCGACG
AGCGCCCTGG ACGTCACCGT TCAACGCCGG ATCCTCGACC ATCTGACCAG CCTCACCCGG
GACGCGGGGA CAAGCGTCCT GCTGGTGACG CACGATCTGG GGGTGGCCGC CGAGCGGGCG
GACCGGATCC TGGTCATGGC GGACGGCCGG ATCGTCGAGG AGGGCGCCCC GGAACGGATT
CTCGACCAAC CGGAGCACCC GTACACCCGG GCGCTGATCG CGGCCGCGCC GAGCCTCGAA
CCCCGCCCGC GACTGGTACC GCGCCCGACG CCGGCAACCC CGGCCGCGGC GCCCCTCTCG
ACAGCACCCT CCACGGCAAC GCCCTCCACG GCAGCACCCT CGACGGCAAC GCACCCCGCG
ACAGCGACGG CCGTGGAGCC ACCGGCCGCG GCACCGCCGG CCGAGGTTCC GCACCTGCGC
GTCGACGACC TGGTCAAGGA GTTCCGGCTG CCCCGCTCCG GGACCTCCGC GCGGGCGGTG
CGCGCGGTGG ACGGCGTCAG TTTCACCATC CAGCGCGGAC GGACCTTCGC GCTGGTGGGC
GAATCCGGCT CGGGGAAGTC GACGGCCGCT CGGCTCGTGC TGCGGCTGAC CGAACCCACC
TCGGGGCGGA TCACGCTCGC CGGCGAGGAC ATCACCCACA CCCGCGGCGA GGAGCAGCGG
GCGCTGCGGC GCCGGATGCA GGTCGTCTAC CAGAACCCGT ACGCCTCGCT GAACCCGCGG
TTCACCGTCG AGCAGATCAT CACCGACCCA CTGGCCTCGT TCGGCGTCGG CGCCCGGGCC
GAGCGCCGGC GGCGGGCCGC CGAGCTCGTC GATCTCGTCG CGCTGCCCGC GGCCATGCTG
ACGCGGCGTC CCGCCGAGCT CTCCGGCGGC CAGCGCCAGC GGGTCGCGAT CGCGCGGGCG
CTGGCGCTGC GCCCGGAGCT CGTCGTCTGC GACGAGGCCG TCTCCGCGCT GGACGTCTCG
GTGCAGGCGC AGATCCTCGA CCTGCTGGGG CGGCTGCAGA CCGAGCTGGG CGTCAGCTAT
CTGTTCATCT CGCACGACCT CGCGGTCGTC CGGCGGATCG CCGACGGGGT CGGCGTCATG
CGCCGCGGCC GGCTGGTCGA GAGCGGCAGC ACCGAAGCGG TCTTCACCAG CCCCGCCGAC
CCGTACACGC GTGAGCTGCT GGCGTCCATT CCCCGCCGCC GGCCGAAACC GCCCAGCTGA
 
Protein sequence
MNTVPDVAPE PVAPEPVARE PVTAGPVAAE RVAAERDERP ALLEIDGLEV DYRPRGEVVH 
AVRSISLRVA AGEVVAIVGE SGSGKSTTAH AVLRLLPRSA RIAAGRIRFD GQDVTGLSER
RFRALRGRDI ALIPQEPTTS LNPTQRIGTQ VAEVLIVHGL ADRRTAGREA LRVLADAGMP
EPELRARQYP SQLSGGLCQR ALISVAIAAK PRLIVADEPT SALDVTVQRR ILDHLTSLTR
DAGTSVLLVT HDLGVAAERA DRILVMADGR IVEEGAPERI LDQPEHPYTR ALIAAAPSLE
PRPRLVPRPT PATPAAAPLS TAPSTATPST AAPSTATHPA TATAVEPPAA APPAEVPHLR
VDDLVKEFRL PRSGTSARAV RAVDGVSFTI QRGRTFALVG ESGSGKSTAA RLVLRLTEPT
SGRITLAGED ITHTRGEEQR ALRRRMQVVY QNPYASLNPR FTVEQIITDP LASFGVGARA
ERRRRAAELV DLVALPAAML TRRPAELSGG QRQRVAIARA LALRPELVVC DEAVSALDVS
VQAQILDLLG RLQTELGVSY LFISHDLAVV RRIADGVGVM RRGRLVESGS TEAVFTSPAD
PYTRELLASI PRRRPKPPS