Gene Franean1_5556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5556 
Symbol 
ID5673886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6731362 
End bp6732930 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content66% 
IMG OID641244412 
Productextracellular solute-binding protein 
Protein accessionYP_001509816 
Protein GI158317308 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.662967 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTA AGAAGCGATT ACTCGGCGCG GCCGTGGTCT GCGTCGCTAC CCTTGTCCTG 
GGAGCCTGCG GGGGCGGTGA TTCTGACACG GCCAGCCCGG CCAGCGATGC CTCCAGCAAA
CCCGTTTCGG GTGGCGTCGC GCGAATCATC ATGACGAGTG ACCCGACCAG CCTGGACCCG
GCGTCGCTGG CCAATCAGGC ACCGATCACG GCGGTGCTGG GCAACGCCCT GTACGGCACG
TTGCTGACCA CGGACGAGAC CAGCAAGGTC GGCTACTCGA TGGCCGAGTC CTTCACCACG
ACCGACGGCG GCGCCACCTT CGAGCTGAAG CTGCGCCCTG ACCTGGTCTT CTCCGACGGT
ACGCCCCTCA ACGCCGCGGC TGTGAAGTTC AACTGGGACC GCATCAAGGA CCCGGCCACC
GCGTCGTCGA GCCTGCCGGA AGCGGCAATG GTCGCCTCGA CCGAAGTGAT CGACGACCGC
ACGATGAAGG TCACGATGAC CACGCCCGTC GCGGCGTTCG CGCAGGCGGT TGTCGGCACG
GTTCTGAACT GGGTGGCCTC GCCGGCAGCG CTGCAGAAGG GCAAGCAGTC CTTCGACGAG
AAGCCGATCG GCGCGGGACC CTTCACCCTG CAGAGCTGGA CCCGTCAGGC CGAGATCAAG
CTCACGAAGA ACCCACGCTA CTGGGACGCG CCCAAGCCCT ATCTCGACGG GATCACGCTG
CGCACGGTGC TCGACTCCAA CCAGCGTTAC AACACTCTGA CCAGCGGCGG CGCTGATGTT
TCCATCGAAA CAAACTGGAT CAACCTCGGG AAAGCTGAGG CCGCTGGCCT CCCGTCCGAC
CTGCTGCCGC TCAGTGGTGG AAACTTCCTG GCGCTCAACA CGCGCCGAGC CCCGTTCAAC
GATATCCGGG CTCGGCAGGC CGTGTCCGCG GCGCTGGACA TCGACGCGCT GAACCTGGCC
GCCTACAACG GGAAGGGCAG TGTCGCGGAC ACGCTGTTCA CCGATGCCTC ACCCTTCTAC
TCGAAGACGC AGCTGAGGTC CACCGACCGG GCGAAGGCAC AGCAGCTCTT CGACGAGCTG
GCGGCCGAGG GAAAGCCGGT GTCGTTCACC TTCTCCAGCT ATCCGACCAG TGAGAACAAG
GCGATCGCGG AGAACGTCCA GGCCCAGCTC AGCAGCTTCA AGAACGTCAA GGCCGAGGTT
GCGATCATCG ATTTCGCGAA GGGTGCCGCG CTGCGCTCGA CCCACGACTT CGACATGGTC
ATCTCGTCGG CGGCCTTCCA GGACCCCGAG CCGCGGCTGC TGGCGAACTT CACCGGGAAC
TCACCGGCCA ACATGTCCGG TCTCGCGGAC CCGGAACTGG ATGCGGCCCT GCTGGCCGGC
CGGACCGCGA CCTCGGTGGC CGATCGTAAG GCGGCCTACG ACAAGGTACA GGCGCGACTG
ACAGCGCTGA CGCCGGTCAT CTTCCTCATG CGATCGGCCC CCGGCGCGAT CGCGGCCAAG
AACGTCAACG GCCTCAGGCA GTACGGCGCC GGCTCCCTGC TGCCCGAGGA GCTGTGGATC
GAGAAGTAG
 
Protein sequence
MARKKRLLGA AVVCVATLVL GACGGGDSDT ASPASDASSK PVSGGVARII MTSDPTSLDP 
ASLANQAPIT AVLGNALYGT LLTTDETSKV GYSMAESFTT TDGGATFELK LRPDLVFSDG
TPLNAAAVKF NWDRIKDPAT ASSSLPEAAM VASTEVIDDR TMKVTMTTPV AAFAQAVVGT
VLNWVASPAA LQKGKQSFDE KPIGAGPFTL QSWTRQAEIK LTKNPRYWDA PKPYLDGITL
RTVLDSNQRY NTLTSGGADV SIETNWINLG KAEAAGLPSD LLPLSGGNFL ALNTRRAPFN
DIRARQAVSA ALDIDALNLA AYNGKGSVAD TLFTDASPFY SKTQLRSTDR AKAQQLFDEL
AAEGKPVSFT FSSYPTSENK AIAENVQAQL SSFKNVKAEV AIIDFAKGAA LRSTHDFDMV
ISSAAFQDPE PRLLANFTGN SPANMSGLAD PELDAALLAG RTATSVADRK AAYDKVQARL
TALTPVIFLM RSAPGAIAAK NVNGLRQYGA GSLLPEELWI EK