Gene Franean1_3247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3247 
Symbol 
ID5675712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3837088 
End bp3838395 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content67% 
IMG OID641242139 
Producthypothetical protein 
Protein accessionYP_001507559 
Protein GI158315051 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTAC GGCGCGTCAG ACGCGGTGCG GGCCTGCTGG CCGCCGCCAT CGCGGTCACA 
CTCGCCGGCT GCAGCGTCGA GTCGAAAGAC GGCCCGGAAA CGACTGCACC GACCTCGAAC
GCCTACCCGG CCCCACCGGC GCCGGCCATC TCACCCGGGG TGACCGACAA CACCATTAAA
ATCGGATTTG TCTACCCGGA TCTCGCCGGG GTCAAGCAGT ACATTCACGT CGACCACGGC
GACTATGCGG CAACATTCAC CGCGCTCGCC GACAAGATCA ATGCGGCGGG CGGCATAAAC
GGCCGCAAGA TCATTCCAGT CTTCGGCGGC GTCAACGTCC TGTCCGCCGC CGGCGCCACC
GAGACCTGCG TCCGGCTCAC CCAGGACGAG AAGGTCTTCG CCGTACTCGG CACTCTCAAC
GCCGACGACT CGCTCTGCTA CGTCCAGACC CACAAGACCG CCCTCGTCGG CGGCGACCTC
ACCACCGCCC GGTACGCCAA GGCCCAGGCG CCCTGGTTCT CGACCCTGCG GGGCGGCGAC
GAGGTGGCCG ACGGCATCGC CGCGTTCGGC GCCGGCGGCG CCCTCGACGG CAAGAAGGTC
GCCGTCGTCA GTAACCAGCT GGAGCAGCAG ACGGCCCGGG ACATCGTCCT GCCGGCGCTG
GACCGGCTCG GGGTCACGCC TGTCGAGAAC GCCACGGTCG AGATCGACGC CGCCGACCAG
GCCGCGAGCA ATCAGCGCAT CAACGTCGTC ATCCAGAAGT TCCAGGCCGT CGGCGCTGAC
ACCGTGATCG TGGTCGGCGG TCTCGCCGGT AACTTCCCGG GCCTGCTCGC CGACACGACG
TACCGCCCGA AGCTGCTGTT CACCTCCAAC AACTCGGCAA CTGCTTTCGT GCGCAGTGAC
GACAACGCCG GCAAGCTCGG CGCGCTGCCG GGATCCACCG GTATCGGGCT GGTCACCGAC
TACAACGAGA AAAGCTTTCT CGACTGCACC CGAACCGTGG CAGCCGCGAT TCCCGAGACC
GCGGGCAAGT TCCAGGCACC CGCGGACGTC CAGTCGGGCC AGCCGACCTA CGCCGTCTCG
GCCAGCGTTG CCTGCAGCAC CCTGAGCCTC TTCCAGGCCA TCGCGGAGAA GGCCGGTAGA
GACCTTACAT ACGCGACCTT CCAGAAGGCC GGGTTCTCCC TCGGCAGCTT CAGGGTCCCC
GCGTCCCTGG ACCCCGCGAC CTTCGGCCCG CAGACCCCGC ACGGCGCGAT CAAGCCCCAT
CTCTACACCT ACGACACCAG CAGCAAGAAC TTCGTCGTCA AGGACTGA
 
Protein sequence
MKLRRVRRGA GLLAAAIAVT LAGCSVESKD GPETTAPTSN AYPAPPAPAI SPGVTDNTIK 
IGFVYPDLAG VKQYIHVDHG DYAATFTALA DKINAAGGIN GRKIIPVFGG VNVLSAAGAT
ETCVRLTQDE KVFAVLGTLN ADDSLCYVQT HKTALVGGDL TTARYAKAQA PWFSTLRGGD
EVADGIAAFG AGGALDGKKV AVVSNQLEQQ TARDIVLPAL DRLGVTPVEN ATVEIDAADQ
AASNQRINVV IQKFQAVGAD TVIVVGGLAG NFPGLLADTT YRPKLLFTSN NSATAFVRSD
DNAGKLGALP GSTGIGLVTD YNEKSFLDCT RTVAAAIPET AGKFQAPADV QSGQPTYAVS
ASVACSTLSL FQAIAEKAGR DLTYATFQKA GFSLGSFRVP ASLDPATFGP QTPHGAIKPH
LYTYDTSSKN FVVKD