Gene Franean1_6824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6824 
Symbol 
ID5675137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8315369 
End bp8316574 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content69% 
IMG OID641245673 
Producthypothetical protein 
Protein accessionYP_001511064 
Protein GI158318556 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.759789 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAGA CTGGAGTGGT CGGACCGTTA TCCCTGGTAG CCGCGATCGT CGTGGGTGCC 
CTGACAGGGT GCTCGGCATC GGGAGCAGGA GCCGGGGCGA ACGGCCCCTG CGACAGTCCC
GGAGTCACGG CCGACCAGGT CAAGTTCGGC TTCGTCTTCT CCGACACGGG CACGGGCAGC
GAGGCGCTCT CGTCGGCCCG TCTGGGAGTT GACGCCAGGA TCGGGCTGGC CAACGAGACG
GGAGGGGTCA ACGGCCGCCG CGTCACCTAC GACTGGCGGG ACGACGCGGC CTCCTCGTCC
ACGAACGTGC GGGTGACCCA GGATCTCAGC AGTTCCACCT TCGGCCTGGT GGGAGTGACC
TCCGCTGTCG GTGACTCCCT CGACAACCTC GAGAAGGAGG GAGTTCCATA CGTCGGTCTC
GTCCAGCCCT CCTACGCCAA GTACCCGAAT GTCTTCGCGC ACCTGTACGA GGCGGCGCCG
GAGACGATCG GCCGCTACTT CCAGGCCAAC GGCGGGACGA AGGTCGCCAT GGTGAGCACC
GGGGCGTCAG CGTTCACGCA GGAGGTCGCC GGGCGGTACC GCAGCGCGTT CGAGGCCGTC
GGCCTGCAGG TGGCCGCGCT GATTCCCTTC GCGGCCAGCG TCGACAGCCC GGCGCGGGTG
GCCCAGCAGA TCGCCGGCAG CGGCGCGGAC GTGCTCATGG GCTTCACCAC CGTGGACGAC
CTGGCCGCCA TCGTGCGGGC CACCCGCCAG GCGAACCTGC GCCTCGCCGC GAGCGTCTCG
ACCAGCGGGT ACGACCGCGG CGTGCTGACC TCGCTGGGGT CGTCGCTCAG CGGGGTCTCC
TTCCCGGTCT ATTTCCGCCC CTTCGAGGCG GGCGGGCCGG CCATCGACCG CTACCGCGAC
GCGATGACCC GTTTCGCGCC GCAGGCCGTC CAGCCCGAAG AGAAGTTCGC TGTGTACGGA
TACATCTATG CGGACATGTT CCTGCGCGGA CTTGAGCTGG CCGGCGACTG CCCGACCCGC
GAGGGCTTCA TCAGCGCGCT GCGGAAAGTG ACCGACTACG ACGCCGGCGG GCTCATCGAG
CCGACCGACC TGCGCACCAA CGCGACCACC CCGCTCCAGT GCGCCGCGTT CGTCCAGGTC
AATCCGGCCG GTGACGCGTT CCAGGTGGTG CGCGAGCGAC TCTGCGCCAA CGGCCAGGGG
AACTGA
 
Protein sequence
MRKTGVVGPL SLVAAIVVGA LTGCSASGAG AGANGPCDSP GVTADQVKFG FVFSDTGTGS 
EALSSARLGV DARIGLANET GGVNGRRVTY DWRDDAASSS TNVRVTQDLS SSTFGLVGVT
SAVGDSLDNL EKEGVPYVGL VQPSYAKYPN VFAHLYEAAP ETIGRYFQAN GGTKVAMVST
GASAFTQEVA GRYRSAFEAV GLQVAALIPF AASVDSPARV AQQIAGSGAD VLMGFTTVDD
LAAIVRATRQ ANLRLAASVS TSGYDRGVLT SLGSSLSGVS FPVYFRPFEA GGPAIDRYRD
AMTRFAPQAV QPEEKFAVYG YIYADMFLRG LELAGDCPTR EGFISALRKV TDYDAGGLIE
PTDLRTNATT PLQCAAFVQV NPAGDAFQVV RERLCANGQG N