Gene Franean1_4474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4474 
Symbol 
ID5675736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5338138 
End bp5339391 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content66% 
IMG OID641243341 
Producthypothetical protein 
Protein accessionYP_001508757 
Protein GI158316249 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTTT GTCGTCCACG GCGGCTGAAC GCTGCCGTCA GTGTGATCGC GATCAGCGGC 
CTGACGAGTG TGGTGGCGGC CTGCGGGATC ATGGGCGGGG GAGACGGGTC CGGGCCAGCG
ACCGAGGGCT GTGCGACACC CGGTGTGACC TCGACCCAGG TCACTCTCGG AACGGCCATT
GACGACACGG GGATCGGCGC GGGTGCCCTC GCGGCGTTCC GCGCCGGGAT CGATGCCCGT
CTCGGCGTGG CGAACGACAA CGGCGGGGTC AACGGCCGGA AGGTCGTCTA TGAGTGGAAG
GACACCCAGT CCGACCCGTC GTCCACCCAG AACGTCGTGC GCGAACTGGT CGAGACCAAG
GGCGTGTTCG GCATCATCCA GGGCTCGATC ATGGCCCTCA GCTCGGCGGA CTACGTCGAG
GAGCACGGCA TCCCGGTGGT CGGGCCCAGC ATGGACGAGT CCTGGCCAGA CCACCCGAAC
ATGATCAGCT GGTTCTACGT CCAGTCCGCG AGATGGTCCG TAAGCACCTG GGGTGACTTC
GCGCGGTCCC AGGGGGCTAC CCGCGTGGCC ATCCTCGGCT TGGCCCTCAA TCCGGGGACC
TACGAGGCTC AGCTACAGGC GAGTATGGAA TCGGCCGGGA TCCCGGTCGT CCTCAACCCG
GACGTGACGG TCGGTGCCAC CAGTTTCAGC CGACTGGCGC AGGAGTTGAA GGCCGCGAAT
GTCGACACGA TCACTGGTGC GGTGACCCCC GACGTCCTGG CCCAGCTCAT GCCGGCCGTG
CGCAACGCTG GCCTCCAGCT GAAGCTGGTG ATGACACCCA CGGGCTACGA CCCTGCCCTG
CTCCAGCGAC TCGGCCCGCA GGTCGCCGGA ACGACGATCT ACGTTGATTT CGCGCCGTTC
GAACTGAACC TGCCGGCCCA CCAGACATTC ATCTCCGCCA TGGCCCGGTA CGCACCCGAG
ACCCAGCCGC CGCAACAGCA GAGTGCCGTG TGGGGATGGA TCTCGGCCGA CCTGTACCTG
CGCGGGCTGC AGGACGCCGG CGAATGCCCG ACCCGCGAGG GGTTCGTGAA CGCGTTGCGC
GCCGTCCACG ATTACGAGGC GGGCGGACTC CTGTCCAGCA AGGTCGACTT CGCGACCAAC
ATCGGGCAGC TCAGCACCTG CTACCAGTTC GTGCAGATAT CCCAGGACGG AAAGGAGTTC
ATCCCGCTCA ACCCCAGCCA GCGCTGCGGT ACCATACTGA GCCGTTTCCG GTAA
 
Protein sequence
MPFCRPRRLN AAVSVIAISG LTSVVAACGI MGGGDGSGPA TEGCATPGVT STQVTLGTAI 
DDTGIGAGAL AAFRAGIDAR LGVANDNGGV NGRKVVYEWK DTQSDPSSTQ NVVRELVETK
GVFGIIQGSI MALSSADYVE EHGIPVVGPS MDESWPDHPN MISWFYVQSA RWSVSTWGDF
ARSQGATRVA ILGLALNPGT YEAQLQASME SAGIPVVLNP DVTVGATSFS RLAQELKAAN
VDTITGAVTP DVLAQLMPAV RNAGLQLKLV MTPTGYDPAL LQRLGPQVAG TTIYVDFAPF
ELNLPAHQTF ISAMARYAPE TQPPQQQSAV WGWISADLYL RGLQDAGECP TREGFVNALR
AVHDYEAGGL LSSKVDFATN IGQLSTCYQF VQISQDGKEF IPLNPSQRCG TILSRFR