Gene Franean1_3676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3676 
Symbol 
ID5672042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4351562 
End bp4352893 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content67% 
IMG OID641242559 
ProductABC-type branched-chain amino acid transport systems periplasmic component-like protein 
Protein accessionYP_001507979 
Protein GI158315471 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.36982 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGTC ATGGCCGCCG CCTGCGCTGC CTGGCCATCC TGCTCGCCGT CGGCTGTCTG 
GTCGCCGCCT GCTCGAACTC CTCCGACTCG AACACCGCCG CCCCCGCCTC GTCCGCGGGT
GGCGGCGCGG TACCGGGTGT CACGAACGCC GAGATCCGGT TCTCCGTCCT GGGGACGCGA
ACCAACAACC CGCTGGGCAC CTGCCTGCTG GACTGCTTCT CCCAGGGGGT CAAGGCGTAT
TTCGACTACC GGAACAGCGA GGGTGGCGTC CACGGCCGCA AGCTCGTCGT CGCCCAGGAG
CTCGATGACG CGCTCGGGCA GAACCAGCAG AAGGCGATCG AGATCGTGTC GGCGAAGGAC
ACCTTCGCGA CCTTCAGCGC CCCGCAGCTC GCCAGCGGAT GGCAGAACTT CGCGGACGCG
GGGATCCCGT TGTATGGATG GGACATCCAC CCGGCGCAGA TGATCGGCCG GAAGAGCATC
TTCGGTAACG CGGCCCCGCC CTGCCTGGAC TGCATGGACC GCACTTTCAC CTACGTCGCC
GAACTGGCCG GCACCAAGCG GATCGCGGCG CTCGGGTACG GCGTGTCCGA CAACTCGAAG
CAGTGCGTCA CCCAGATCAC CGACACGATC GAGCGGTACG GTGCGGCCAC CGACCAGGAG
ATCGTCTACA AGAATGACAC CCTGGCGTTC GGCCTGAGTA ACGGGGTCGG TCCGGAGGTG
GCCGCGATGA AGCGCGCGGA CGCGCAGCTG GTCATCACCT GTCTCGATCT GAACGGCATG
AAGACCCTGG CCCAGGAACT CGAGCGGCAG GGCATGGGCG ACGTCCCGAT GTACCACCTG
AACACCTACA ACCAGGAGTT CGTCGAGGAG GCCGGCGCGC TTTTCGAGGG TGACTACGTC
AGTGTCGGGT TCCGGCCGTT CGAGGCCGAT CCCGCGGACA CCGCCATGGC CACGTTCAAG
AAGTGGATCG GCAAGGTGGG CGGCCGGCCC GACGAGATGG CGATGTACGG ATGGATCAAC
GCGGACCTCG CCTACCAGGG CATCCTCAAG GCGGGGCCGG CGTTCACCCA GGCGTCCGTG
ATCGACGCCA CCAACCACCT GACCGACTAC ACCGCGGATG GCCTGACCGT TCCGGTGGAC
TGGTCGCGTC AGCACAACCA GCGTTCCAAG GCCGACCCGG TCACCAACGG CTACAAGCTC
GACTGCCGGG CGCTCGTCCG CGTGCGGGGC GGGAAGTTCG AGATCGTCGG TGGAACGAAG
GACAAGCCGT TCGTCTGCTG GCCGCCGAAG GACACCACCT GGTCCGAGCC GAAGCCGACG
AGCTTCGGCT GA
 
Protein sequence
MKRHGRRLRC LAILLAVGCL VAACSNSSDS NTAAPASSAG GGAVPGVTNA EIRFSVLGTR 
TNNPLGTCLL DCFSQGVKAY FDYRNSEGGV HGRKLVVAQE LDDALGQNQQ KAIEIVSAKD
TFATFSAPQL ASGWQNFADA GIPLYGWDIH PAQMIGRKSI FGNAAPPCLD CMDRTFTYVA
ELAGTKRIAA LGYGVSDNSK QCVTQITDTI ERYGAATDQE IVYKNDTLAF GLSNGVGPEV
AAMKRADAQL VITCLDLNGM KTLAQELERQ GMGDVPMYHL NTYNQEFVEE AGALFEGDYV
SVGFRPFEAD PADTAMATFK KWIGKVGGRP DEMAMYGWIN ADLAYQGILK AGPAFTQASV
IDATNHLTDY TADGLTVPVD WSRQHNQRSK ADPVTNGYKL DCRALVRVRG GKFEIVGGTK
DKPFVCWPPK DTTWSEPKPT SFG