Gene Franean1_3223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3223 
Symbol 
ID5671599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3805722 
End bp3807443 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content74% 
IMG OID641242117 
ProductABC transporter related 
Protein accessionYP_001507537 
Protein GI158315029 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0410] ABC-type branched-chain amino acid transport systems, ATPase component
[COG0411] ABC-type branched-chain amino acid transport systems, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCG GCCTTACGGA AACGGGGGAG ACCGGCCAGG CGGGTGACGA CGTAGCGCCC 
GTCACCGGCT CGTCCGGACC AGCCCGGCTG GAGTGTGTGG GTGCCACCGT GCGTTTCGGC
GGCCTCGTTG CCGTCGACTC GGTGGACCTG ACTGTCCCGC GGGGCTCGAT CGTCGGGCTG
GTGGGGCCCA ACGGGGCCGG GAAGAGCACC CTGTTCGGGG TGCTGTCCGG CCTGCTGCGT
CCGGCGCGTG GGAAGGTCCT GCTCGACGGC GAGGACGTCA CGCACACGAG CGCGCAGGAG
CGAGCCACCC GAGGATTGGC CAGAACCTTC CAGCACCCGG AGCTGTTCGG CAGCCTCACC
GTCCGGGACC ACCTCGTGCT GGCTCACCGG GCCAGGCACG CGAAGCGGAG GGTCTGGTCC
GACCTGTTCA CCGCCGGCAG CCTGCGCCCG GCCCGGTCGG AGGAGAACGA GAAGGTCGAC
GAACTGCTGG AGCTTCTCGG ACTCACCGAG ATCGCCCACC GCTGCGCGGT GGGGCTGCCG
CTCGGGACGG CTCGCCTGCT CGAGTTCGGC CGGGCACTGG CCAGCGACCC GACCGTCCTC
CTCCTGGACG AGCCGTCGTC GGGCCTCGAC TCGGCGGAGA CCGAGCAGAT GGAAGGCGTG
CTGCAGCGGG CCACCAGCGA GCGCGGAATC TCCGCGCTCC TGGTCGAGCA CGACGTCGAG
CTCGTGATGC GGCTGTCGAG CGCGGTCTAC GTCCTCGACT TCGGACGGCT GATCGCCAGC
GGGCCACCCG ACGAGATCCG GGCGAGCCCG GCGGTGCGGG CGGCCTATCT CGGCGAGGAA
CTGACGTCGG CGGACTCCGC GGACGACGAC GGCGCCCCGG ACAGCGAGCT CGCCGCAGGG
GCCGCTCTGT CGGCGGCCGC TGTGTCGCCG GCCGGCCGGC CTGTGCCGGC CGCGGACAAG
GGCGCCCCCG GCGAGGCGGA GGTGGCCGGG GACCCCGCTG GTCGCGTGCT GCTGACGGTC
GAGGGCCTCA CGGTGCGCTA CGGTGAGGCG CTGGCGCTCG ACGGCGTCTC CTTCACGCTG
GGCACGGGTC GGGCGCTGGC CGTGCTGGGT GTCAACGGCG CGGGCAAGAG CAGCCTCGCC
CGAGCGGTGT CGGGCCTCGT CCCGCCGACC GCCGGCCGGG TCGTGCTCGC CGGCGAGGAG
GTCACCTCCT GGCAGCCGCA TCGCATCCGG CGGGCAGCGA TGGTGCACCT GCCCGAGGGC
CGGGGCGTGT TCCGCGGGCT CAGTGTGATC GACAATCTGC GTATGGCGGC CGCGGCGGTC
GACGGCCGCC GAGCCCGGCG CGAGGCCGTG GACCTCGCCC TCGAGATCTT CCCCGTCTTC
GCCGCCCGGC GCCGACAGCT CGCCGGCCTG CTGTCGGGTG GCGAGCAGCA GATGCTCTCG
CTGGCCCGGG CGCTGGCCAC CTCGCCGCGA CTGGTGATCG CCGACGAGCT GTCGCTGGGC
CTGGCACCGA AGATGGTCGA CCTGGTCTTC GACGGGCTGG CCCAGGCGCG GCAGGCGGGC
GTGGCGGTGA TCATGATTGA GCAGTACGTG CACCGGGCGC TGGACTTCGC CGACGACTGC
CTGGTCCTGC AGCGTGGCAC GGTGGCCTGG CAGGGCTCCG CGGCCGGTGC GCGCGGTGAG
GTTCTGCGCC ACTACCTCGG CGAGGCCACC ACGGCGGCCT GA
 
Protein sequence
MTGGLTETGE TGQAGDDVAP VTGSSGPARL ECVGATVRFG GLVAVDSVDL TVPRGSIVGL 
VGPNGAGKST LFGVLSGLLR PARGKVLLDG EDVTHTSAQE RATRGLARTF QHPELFGSLT
VRDHLVLAHR ARHAKRRVWS DLFTAGSLRP ARSEENEKVD ELLELLGLTE IAHRCAVGLP
LGTARLLEFG RALASDPTVL LLDEPSSGLD SAETEQMEGV LQRATSERGI SALLVEHDVE
LVMRLSSAVY VLDFGRLIAS GPPDEIRASP AVRAAYLGEE LTSADSADDD GAPDSELAAG
AALSAAAVSP AGRPVPAADK GAPGEAEVAG DPAGRVLLTV EGLTVRYGEA LALDGVSFTL
GTGRALAVLG VNGAGKSSLA RAVSGLVPPT AGRVVLAGEE VTSWQPHRIR RAAMVHLPEG
RGVFRGLSVI DNLRMAAAAV DGRRARREAV DLALEIFPVF AARRRQLAGL LSGGEQQMLS
LARALATSPR LVIADELSLG LAPKMVDLVF DGLAQARQAG VAVIMIEQYV HRALDFADDC
LVLQRGTVAW QGSAAGARGE VLRHYLGEAT TAA