Gene Franean1_4888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4888 
Symbol 
ID5673228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5864651 
End bp5867179 
Gene Length2529 bp 
Protein Length842 aa 
Translation table11 
GC content71% 
IMG OID641243743 
Producthypothetical protein 
Protein accessionYP_001509159 
Protein GI158316651 
COG category[M] Cell wall/membrane/envelope biogenesis
[V] Defense mechanisms 
COG ID[COG0577] ABC-type antimicrobial peptide transport system, permease component
[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.355305 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.171205 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGCG CCACGCTGCG GAGCCTGCTG GCCCGCAAGG TACGGCTTGT CCTGTCCGCG 
CTCGCGGTCG TCGTGGGTGT CAGCTTCGTC ACCGGCACCC TGGTCCTCAC CGACACGCTG
AACCGGACGT TCGACACCCT GTTCACCGAC ATCAACAAGA ACGTCAGCGT CTCCGTGCGG
ACCGTCAACG CGGTCGGCAC GGGCGACCAG GCCGACCGGA AGCCGGTGCC CGCCACACTG
GTCCCCACCG TCGCCAAGGT GGACGGAGTC ACCGCGGCGA CCGGGACGGT CCGGGGCCAG
GCGGTCCTGA TCGACCCCGC GTCGGGCGAC CCGCTCAACA GCGGCGGCGC ACCGGGCATC
GGAACGAACT GGACCGGCGG CACCGCGACG GGGTCGGAGG AGATCGCCGA CGGCCGGCCA
CCGAACGGCC AGGAGATCGC GGTCGACAAG TCGACCGCGG ACAAACACCA CCTCACGCTC
GGCCAGCGGA TCTCGGTGCA GACCCGCCTC CAGCCCGAGG AGTTCACCCT CGTCGGCACC
TTCCGCATCG GCGGGCAGGA CAGCCTCGGC GGCGCCGCCG TCACCACGTT CGACACGGCG
ACGGCGCAGC GGCTGCTGCT CGCGCCCGAC CAGTTCACCT CGATCAGCCT CGCCGCCGCC
AACGGCGTCT CGCAGGAGCA GCTGCGGGAC AGGGTCGCCG CGGTGCTCCC CACCGGGGTC
GAGGCGATCA CCGGCACCCA GCTCGCCGAG GAGGGCGCGA GCGCGATCCA GGACGCCGTC
AGCGGGTTCT CGACCTTCCT GCTGGTCTTC GCCGGCATCG CGGTGTTCGT CGGCGCCTTC
ATCATCTTCA ACACGTTCAC GATGCTGGTC GCGCAACGCG TCCGTGAGCT GGCGCTGCTG
CGCGCGATCG GCGCGAGCCG GGCCCAGGTG CAGCTCTCGG TCCAGATCGA GGCCCTGATC
GTCGGCTTCA TCGGCGCCAC GGCCGGTCTC GCTCTGGGCG CCCTGCTGGC GATGGGGCTG
CGGGCGGCCG TCGGCGCGTT CGGGATCTCG CTGCCGTCCG GCCCGCTGGT CTTCCAGCCT
CGGACCTTCC TGCTCGCCTA CGGCGTCGGC CTGGTCATCA CCGGTCTCGC GGCGTTCGTG
CCGGCCCGCA AGGCGGCGTC GGTGCCGCCG GTGGCGGCGA TGCGCGAGAC GTACGTGCTG
CCCACCAGGT CGCTGCGCAC CCGGGCGCTC GGCGGCGGCG CGCTGACCGT GCTGGGAGTG
ATCTGCCTGG TGGCGGGCCT GGCCCGCGGG GAGAATCCGA ACACGAGCAT GGTCGGCGCC
GGCGCGGCGT TCGTCTTCCT CGGCATCGCC ACTCTCTCGC CGCTGCTGGC CCCGCCGATC
ACCCAGGTCG TGGGGATTCC GCTGCGCGCG ATGTTCGGGA CGACCGGCCG GCTCGGCCAG
GAGAACGCCA TCCGCAACCC GCGGCGCACC GCCTCCACCG CGTCAGCGCT GATGATCGGT
CTGGCCCTGG TGAGCGCGTT CGCCGTGCTC GGCCAGTCCA TCAAGGAGTC GGTACGCGAG
ACGGTGTCGG AGAGCCTGGG TGCCGACTTC TACATCGCCA CATCCAACTT GGCCCCCTTC
AGCCCGCAGG TGGCCGCGGG GCTCCAGGGC AAGCCAGGCG TGGCGGTCGC CACCGGCATC
CGGGGCGGGG CGGTCAAGAT CGGCGACACG GACTCGTCGG TCCTCGCGGG CGATCCGGCC
GGCCTGCTGC AGGTCCTCTC CATCAAGCAG GTGGACGGCG ACGTCAACGC GCTCGGCGCG
GGTTCCCTCC TGGTCGACGA GGCGACGGCG GCCGAGCGGG GGCTGCGGGT CGGCGCGCCC
GTCCCGGTCA CGTTCGCGGA CGGCCCGACC GAGCTGACCC TGGTGGGCAC GTACGAGAAG
AGCGCGATCG CCGGCCCCGC GATGATCGCG ACCAGCGAGT TCGAGAAGCA CTCGAACAAC
AACCTCGACC TGTTCGTGAT GCTCAAGCTG GCCGACGGCG CCGACCCGGC CGCGGTCCGG
GCCGAGATCG ACAAGGTGAT CAAGCCGTTC GGCAGCGTCG AGGTCCGCGA CCAGTCGGAG
TTCGTGGCCC AGCAGGAGCA ACAGGTCGAC CAGCTGCTCG GGTTCGTGTA CGTCCTGCTG
GCCCTGGCGG TGGTGATCGC CCTGTTCGGC ATCGTGAACA CCCTAGCACT CTCGGTGATC
GAACGCACCC GGGAGATCGG GATGCTCCGG GCGATCGGCA TGACCCGGCA ACAGATGCGG
ATGATGGTCA TCGTCGAGTC GATGATTATC TCGGTCTTCG GTGCGGTGCT CGGTGTGCTG
GTCGGCAGCT TCTTCGGCTG GGCGCTGACC GGGGCGCTGA AGAACCAGGG GGTGACGACC
TTCGCCTATC CCGTCGGAAC GATCATCGCC GTGATGATCG CCGGCGCCAT CATGGGCGTG
CTCGCCGCGG TCTTCCCGGC GCGTCGAGCC GCCAGGATGG ACATCCTGCG GGCGATCGCC
ACGACCTGA
 
Protein sequence
MLRATLRSLL ARKVRLVLSA LAVVVGVSFV TGTLVLTDTL NRTFDTLFTD INKNVSVSVR 
TVNAVGTGDQ ADRKPVPATL VPTVAKVDGV TAATGTVRGQ AVLIDPASGD PLNSGGAPGI
GTNWTGGTAT GSEEIADGRP PNGQEIAVDK STADKHHLTL GQRISVQTRL QPEEFTLVGT
FRIGGQDSLG GAAVTTFDTA TAQRLLLAPD QFTSISLAAA NGVSQEQLRD RVAAVLPTGV
EAITGTQLAE EGASAIQDAV SGFSTFLLVF AGIAVFVGAF IIFNTFTMLV AQRVRELALL
RAIGASRAQV QLSVQIEALI VGFIGATAGL ALGALLAMGL RAAVGAFGIS LPSGPLVFQP
RTFLLAYGVG LVITGLAAFV PARKAASVPP VAAMRETYVL PTRSLRTRAL GGGALTVLGV
ICLVAGLARG ENPNTSMVGA GAAFVFLGIA TLSPLLAPPI TQVVGIPLRA MFGTTGRLGQ
ENAIRNPRRT ASTASALMIG LALVSAFAVL GQSIKESVRE TVSESLGADF YIATSNLAPF
SPQVAAGLQG KPGVAVATGI RGGAVKIGDT DSSVLAGDPA GLLQVLSIKQ VDGDVNALGA
GSLLVDEATA AERGLRVGAP VPVTFADGPT ELTLVGTYEK SAIAGPAMIA TSEFEKHSNN
NLDLFVMLKL ADGADPAAVR AEIDKVIKPF GSVEVRDQSE FVAQQEQQVD QLLGFVYVLL
ALAVVIALFG IVNTLALSVI ERTREIGMLR AIGMTRQQMR MMVIVESMII SVFGAVLGVL
VGSFFGWALT GALKNQGVTT FAYPVGTIIA VMIAGAIMGV LAAVFPARRA ARMDILRAIA
TT