Gene Franean1_7028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7028 
Symbol 
ID5675339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8572164 
End bp8573423 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content62% 
IMG OID641245874 
Producthypothetical protein 
Protein accessionYP_001511265 
Protein GI158318757 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.557944 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAA CCTTAAAAAG AAGGCGGCCA TTCTTTATGG TCTCCGCCGT ACTTTGCGGC 
GCGCTAATTC TCGCTGGTTG TAGTCCAGTC GGCGAGCAAC CGTCCGTATC AACAGCCAGC
CAGGCGTGCA ATACGCCCGG AATCACCGCA GGCGAGGTCC GCCTTGGGTT CTTGATGTCC
GAGAGTGGCG CGGGAGCATC GATGGCCCAG CCGTTCAGGG CGGGCGTCGA CGCCCGCCTG
GGCGTGGCGA ACGCGGCCGG AGGGGTCCAC GGACGCAAGG TTACCTACGT GTGGCGAGAC
GACGAGTCGG CATCGGCTGT CAACCTTGCT TCGGCACGAC AGCTCCTCGC CACGGACGAC
GTATTCGGAA TGATCGAGGC CAGCGCGGAG GCGTTCGGTT CATCAGCACT TCTACACAGC
TCCGGGATCC CAGTCGTGGG TATCGCGCTG GATCCCACCT GGGCCTCCAA TGACAACATG
TTCAGCTTTA CGAACATGAT GGCAAATAAT TCTTCCATCA GCACCTGGGG TGATTTTGTT
GCAGCCCAGG GCGGCCGTCG CGCATTGATA TTCAAGCCCA TCTTCTCCGC GGCTTCAGAC
ATCCTTGCGA TGAAAATGTC CGACAGTCTG CAGGCGGCTG GTGTAGCCGT TGTCGGCAAT
AACGAGATCA GCCCGATGAC GCTCGTTCCC GCCGTCATCG GCGAGCAGAT CAGAGCCACA
GCAGCCGACA CCCTGATATT TGCCACAGAC GCTGAGAACT CCTATCGGAT CGTGGCAGCG
GCCCGGGCAG CCGGTGCGGC AATCAGGGTC GCTCTGGTTC CGCCGGACGG CTACGACCCC
CGAGCGCTCC ACGAATGGGG AAGCGCCATC GCGGGCACGT ACTCCTATCT TCCGATCACA
CCGTTTGAGG TGAGCACCCC CGTCTACCGC GGGTTCTTCA ACGCCATGGC CGCCTACTCG
GCCCAGTTGC AGCCGCCGAA TCAAACCTAC GCGGCGGAGG GCTGGATCGC CGCCGACATG
TTCCTGCGCG GGCTGGCCAT GGCGGGGGGC TGCCCGACCC GCGCAGAGTT CATCAGCAGC
CTGCGGTCCG TTCAGGCCTA CGACGCCGAA GGACTACTGC CCGCGTCGTT GAACATCAGC
ACGAGCGTTG GCGAGATCAT CCGCTGCCTT CACTTCGTGC AGGTCGCGCC CGACGGGACC
CATTTCACGC AGGTAACCCC AACACCGTTG TGTGGCAGGC GGCTGGCCAC AAACGACTGA
 
Protein sequence
MMKTLKRRRP FFMVSAVLCG ALILAGCSPV GEQPSVSTAS QACNTPGITA GEVRLGFLMS 
ESGAGASMAQ PFRAGVDARL GVANAAGGVH GRKVTYVWRD DESASAVNLA SARQLLATDD
VFGMIEASAE AFGSSALLHS SGIPVVGIAL DPTWASNDNM FSFTNMMANN SSISTWGDFV
AAQGGRRALI FKPIFSAASD ILAMKMSDSL QAAGVAVVGN NEISPMTLVP AVIGEQIRAT
AADTLIFATD AENSYRIVAA ARAAGAAIRV ALVPPDGYDP RALHEWGSAI AGTYSYLPIT
PFEVSTPVYR GFFNAMAAYS AQLQPPNQTY AAEGWIAADM FLRGLAMAGG CPTRAEFISS
LRSVQAYDAE GLLPASLNIS TSVGEIIRCL HFVQVAPDGT HFTQVTPTPL CGRRLATND