Gene Franean1_4487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4487 
Symbol 
ID5672837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5353769 
End bp5355034 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content69% 
IMG OID641243354 
Productputative high-affinity branched chain amino acid ABC transporter, amino acid-binding protein 
Protein accessionYP_001508770 
Protein GI158316262 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGCC GAACTCGCAA TCTGGCCGTC CTGCTGGGCT TAGCCACCGC CCTGACCGCC 
GCCTGCGGCA GCGCCCCGAA GTCGGACACC GGCGGGGGGG AGACGGGCGC CGCTGACGCG
GCCGCACTCG GGCCGGTCGT CGCGGCCCCC ACCGGCACCC CGCTCGTCAT CGGCTACATC
AGCCAGGAGA ACACGGCGGT GGGGTCCTAC CCCGAGGCGC TCGCCTCGGC GCGGGCAGCC
GCTGACTACA TCAACAAGCA TCTCGGCGGA GTGCACGGGC GGCCCCTCGA ACTGTCTTCC
TGCGTCACTG ACGGGTCGGT CGCGACTTCG GCGAACTGCG CGCGGCAGAT CGCGTCCACC
TCCGGCGTGG TCGCCGCCTC AAGCAGCCTC GACTTCGGTG CCCAGGGCGC CGTACCGGTG
CTCCAGGCTG CCGGCATCCC CCGTATCGGC GGGATCGCGA TCTTCCCGGA GGAGGCGTCT
TCCCCGACCG TCTTCAACTT CGCGGGCGGT TCCTTCGCGG CCTTCCCCGC GATCGACACC
TTCGTCGCCA CCGTCCAGAA GGCCGGGCGC GTGAGCGCCC TGACATCCGA CACCTCACCC
GGCATCGCCT CGGCGAATGA CCAGATCAAG ACTCCGTTGC AGCGGGACCT CGGTTACAAG
GACGTGCCGA TCGTCGTCGC GGCTCCGGAC GCCGCCGACC TGACCGGCGC GCTGACCCAG
CTCAACGCGT CCAAGCCGGA CGCGGTGGTG AGCAGCTTCG GGCAGGCGTG CGTGCGCATC
ATGCAGGCGA AGAAGGCGCT CGCCCTGCCG TTCACGATGT ACCACACCAG TAAATGCCTC
GACGAGCGTG TGCTGCAGAG CGCGGGCGAG GCCGCCGAAG GGCACCGCTT CAACTCCGAG
ACGCGGATGT GGAACGAGAA GGACGACGAC GCGGCGATCT ACCGGGCCGC GATGGCGAAG
TACGCGTCCG GGACGACGCT GAGCAACTAC TCGACGATCG CCTTCCAGGG GATCATGAAC
ACCTACCGCC TGCTGAACAA GATGGACGAG GCGAGCCTCA CCCCGAAGGC GCTGGTCGAG
AAGATCCGCA CCACCAGCGA CGAGCCGAGC TTCCTCGGCT GGACCTACAC CTGTGACCCG
GCGAAGCTCG CCGTGGCCGG CCAGTCCGGC CTGTGCAGCA CCCTAGAGGT GATCGTCGAG
GTGAAGAACG GCGTTCCGAC CACCATCTCG GACCCGATCG ACGGCTCGAA GCTCCTGCGG
CTCTGA
 
Protein sequence
MMRRTRNLAV LLGLATALTA ACGSAPKSDT GGGETGAADA AALGPVVAAP TGTPLVIGYI 
SQENTAVGSY PEALASARAA ADYINKHLGG VHGRPLELSS CVTDGSVATS ANCARQIAST
SGVVAASSSL DFGAQGAVPV LQAAGIPRIG GIAIFPEEAS SPTVFNFAGG SFAAFPAIDT
FVATVQKAGR VSALTSDTSP GIASANDQIK TPLQRDLGYK DVPIVVAAPD AADLTGALTQ
LNASKPDAVV SSFGQACVRI MQAKKALALP FTMYHTSKCL DERVLQSAGE AAEGHRFNSE
TRMWNEKDDD AAIYRAAMAK YASGTTLSNY STIAFQGIMN TYRLLNKMDE ASLTPKALVE
KIRTTSDEPS FLGWTYTCDP AKLAVAGQSG LCSTLEVIVE VKNGVPTTIS DPIDGSKLLR
L