Gene Franean1_3249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3249 
Symbol 
ID5671623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3841378 
End bp3843627 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content69% 
IMG OID641242141 
ProductABC transporter related 
Protein accessionYP_001507561 
Protein GI158315053 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0410] ABC-type branched-chain amino acid transport systems, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGAGTG ACCAGGCGGT GTCGCCAGGC GGAGGGCCCA CCGAGGCCGC GGCCTCGGTC 
GGGTCCCCGG CGTCGGTCGC GGGGCTGGCC TCGGGACTGA TCGCCGCCGA GGCGGAACGG
CGAGAACAGC AGGCGGCAGG GCGGGAGGTC CTGTTCGCGG ACGAGCTGCT GCCGGGCGTG
GGTGACGAGC AGTTGTCATT ACGCGCGGGT CTGGCGGCCG GCGGCTCGAT GACGTTCCTG
ACGTTGGTGA CGTTGAGCGC GCTGGATGAG CTGGAGTCGG CGTCCGTCGG CGTTCTGGCA
CCGGATATCC GGGACTCGTT CGGCATCGGC AACGGTCTGA TGGTCTTCAT CTCGGCGGCG
TCGGGCGCTT TCCTGGTGCT GGGCGCGCTG CCGATGGGAT GGTTGGCGGA CCGGTGCCGG
CGCAGCCGCA TCATCGGTTG GGCGGCCGTG GCTTTCTCCG TCATGGTCTT CCTGTCGGGC
CTGGCGGCGA ACGCGCTGCT GTTCTTCCTG GCACGGTTCG GCGTCGGCGT CGCGAAGTCG
AGCAACAACG CGGTCCACGG CTCGCTGCTG GCGGACACGT ATCCGATCGG CATCCGGGGC
CGGATCTCGG CGGTGAACTA CGGGTCCGCC CGCGCGGCCG GGGCGTTGAG CCCGCTGGTG
GTCGCCGGTA TCGCCACCCT CGCGGGCGGC TCGGCGGGAT GGCGCTGGCC GTTCCTGGTG
CTGGGGTTAC CGGCGTTACT GATGGCGCTG CTGGCTTTCC GCCTGCCGGA ACCGGCACGG
GGTCAGCACG AGATGAAGTC GGTGCTGGGC GAGGTTCTGC ACGAGGCCGA CCCGATGCCC
ATCTCGGTCG AGGCGGCGTT CTCCCGCCTG ATGCAGATCC GAACCGTCAA AAGCGCGATC
CTGGCCTTCT CCGCGCTCGG CTTCGGCCTG TTCACCACAG GTGTCCTCGG CAACCTGTGG
GCAGAGGACC ACTACGGGAT GTCGACATTT CAGCGCGGTC TCATGGGCAC CCTCGGCGGG
ATCACGCTGT TGGTCTGCCT CCCGTTGGTG GCACCGCGCT ACGACCGGCT CTACCGCAGC
GACCCGGCGC GGGCGCTGCG GTTGCTGGGG CTGTGCATCG CGCCGATCGC GATCCTCCTG
CCGATCCAGT GGTTCATGCC CGGGTGGGTC GGGTTCATGC TGGCCGGCGT ACCTGGTGCC
GCCCTGACCT CGGTGGCCTT CTCCATGGTC GGCCCCGTCC TGCAGTCGGT GGTGCCCTAC
CGACTGCGAG GTCTGGGCGC CGCACTGGGC GCGGTGTACG TGTTCTTCAT CGGCGCCACC
GGCGGCGCGG TCCTGGCAGC CGTGCTGAGC GATGCCTACG ACCCCCGGGT CGCCGTCCTG
CTGATCGGGA TCCCCGCCCA CGCCGTCGGT GCGTACCTGC TGGTCCGCGG CGCCTCCTTC
ATCCGCAGCG ACCTGTCCCT CGTGGTCGCC GACCTGCGCG AGGAACTGGA CGAACACGAC
CGGCAGAAGG CGGACCCGGA GAACATCCCG GTGCTGCAGG TGAACGACAT CGACTTCTCC
TACGGTCAGG TCCAGGTCCT GTTCGACGTC GCCTTCGAGG TGAGACGCGG CGAGACCCTG
GCGCTGCTCG GCACCAACGG CGCCGGCAAG TCGACGATCC TGAAGGTCAT CTGCGGTCTG
GGCACCCCGT CGCGTGGGGT GGTGCGGCTT GGCGGCAGGA CGATGACCTA CGTCCCGCCG
GAACAGCGCG GCAGATACGG CGTCCACCTG CTACCCGGCG GCAAGGGCGT CTTCCCCGCC
ATGACGGTTC GGGACAACCT CGAGATGGCG GCGTTCCGGT TCCGCGCCGA CCAGGCCCGT
CGCGACGCAC GCTTCGGCTA CGTCCTGGAT CTTTTCCCTG ACCTGAAGGA CCGGCAGCGG
CAGTCGGCCG GCTCCCTGTC CGGCGGACAG CAGCAGATGC TCGCGCTCGC CATGGTCCTG
ATGCACGATC CGGAAGTACT GCTCATCGAC GAGCTCTCTC TGGGGCTCGC ACCCGTGGTC
GTGCAGGACC TGCTCACGGT GCTGGAGCGG CTGAAGGCCG ACGGCCTCAC GATCATCGTG
GTCGAGCAGT CCCTGAACAT CGCCCTCGCG ATCGCCGACC GCGCCGTGTT CCTCGAAAAG
GGCCAGGTCC GCTTCACCGG GCCGGCGCGT GAGCTCGCCG AACGCGACGA CCTCGCCCGC
GCCGTGTTCC TCGGCAAGGA AGGCGGCTGA
 
Protein sequence
MASDQAVSPG GGPTEAAASV GSPASVAGLA SGLIAAEAER REQQAAGREV LFADELLPGV 
GDEQLSLRAG LAAGGSMTFL TLVTLSALDE LESASVGVLA PDIRDSFGIG NGLMVFISAA
SGAFLVLGAL PMGWLADRCR RSRIIGWAAV AFSVMVFLSG LAANALLFFL ARFGVGVAKS
SNNAVHGSLL ADTYPIGIRG RISAVNYGSA RAAGALSPLV VAGIATLAGG SAGWRWPFLV
LGLPALLMAL LAFRLPEPAR GQHEMKSVLG EVLHEADPMP ISVEAAFSRL MQIRTVKSAI
LAFSALGFGL FTTGVLGNLW AEDHYGMSTF QRGLMGTLGG ITLLVCLPLV APRYDRLYRS
DPARALRLLG LCIAPIAILL PIQWFMPGWV GFMLAGVPGA ALTSVAFSMV GPVLQSVVPY
RLRGLGAALG AVYVFFIGAT GGAVLAAVLS DAYDPRVAVL LIGIPAHAVG AYLLVRGASF
IRSDLSLVVA DLREELDEHD RQKADPENIP VLQVNDIDFS YGQVQVLFDV AFEVRRGETL
ALLGTNGAGK STILKVICGL GTPSRGVVRL GGRTMTYVPP EQRGRYGVHL LPGGKGVFPA
MTVRDNLEMA AFRFRADQAR RDARFGYVLD LFPDLKDRQR QSAGSLSGGQ QQMLALAMVL
MHDPEVLLID ELSLGLAPVV VQDLLTVLER LKADGLTIIV VEQSLNIALA IADRAVFLEK
GQVRFTGPAR ELAERDDLAR AVFLGKEGG