Gene Franean1_3920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3920 
Symbol 
ID5672281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4687062 
End bp4688633 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content67% 
IMG OID641242799 
Productextracellular solute-binding protein 
Protein accessionYP_001508216 
Protein GI158315708 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.407855 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0244216 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCGTA AGTCCCGGTT AGTCGCCACC GCTGTCGTGT GCGGCGCGGC ACTCGCCCTG 
GGGGCCTGTG GTGGTGGAGG CGGTGACGAC GCCACCTCGT CGACGAACGG CGCCGCGGGC
CAACCGGTCG CCGGCGGCGA GGGCCGGATC CTGCTGCTGG GCGACCCGCG CAGCCTGGAC
CCGGCGACCC TCAGCAACCA GGCCGCGATC ACCGCCCCGG TCGGCAACGC GCTGTACGGC
ACGCTGATGA TCACCGACCA GGCCGGCAAG GTCAAGTACA CGATGGCCGA GTCGTTCGAC
ACGACCGACA CCGGCAAGAC CTTCACCCTC AAGCTGAAGC ACGGCCTGGT GTTCTCCGAC
GGCAAGCCGC TCAACGCGGA GGCCGTCAAG TTCAACTGGG ACCGCATCAA GGACCCGACC
GTGGGCTCCT CCTACGTCGT GGACGCGCGG ATGATCGAGT CGACCGAGGT GGTCGACGAC
GTGACACTGA AGGTCACGAT GGTCAACCCC GTGCCGGCGT ACGCCCAGGC CGTCCTGAAC
TCGTCACTGA ACTGGATCGC CTCGCCCGAC GCTCTGAAGG CCGGACGGGA CTCCTTCGAC
AAGAACCCGA TCGGCGCCGG GCCGTTCACC CTGGCGAGCT GGACCCGCCA GGCGGACATC
AAGTTCGTCA AGAACCCCAA GTACTGGGAC GCGCCCAAGC CCTACCTGGA CCGCCTCACC
ATGCGCTCGG CGACCGACGC CACCCAGCGC CTCAACACGG TGATCAGCGG TGGCGCCGAC
GTCGCGATCG ACACGAACAC GGTCAACATC GACAAGGCTG AGACGTCCGA TCTCAACGCG
GTCGTGACCA CCCTCAACGG CGGCAACTTC ATGGCGTTCA ACTCGCGCCG GGCGCCGTTC
GACGACATCC GTGCCCGCCA GGCGGTGTCG GCGGCGATCG ACCTCGAGGC GCTCAACCTC
GCCGCCTACA ACGGCACCGC TCCCCTGCCC GACACGCTGT TCGACAAGAG CTCACCTCTC
TTCTCGGACA CGCCGCTACA CAAGACGGAC AAGGCTCTCG CCCAGAAGCT CCTCGACGAG
CTCGCCGCCG ACGGCAAGCC GGTGAAGTTC ACCTTCTCCA GCTTCCCGTC CTCGGAGAAC
CGGGCGATCG CGGAGAACAT CCAGGCCCAG CTGAGCGCCT TCAAGAACAT CACGGTCTCC
GTCAAGATCG TCGACCTCGG CCAGGTCGCG GCGCTGCGCA CGACCTTCGA CTTCGACCTG
CTCGTCTCGT CGGCGTCGTT CCAGGACCCG GAGCCGCGGC TGTGGCAGGC GTTCAGCCAG
GACTCCGTGG CGAACCTGTC CGGTGTCAAG GACAAGGAGC TCTCGGACGC GCTGCTCGCG
GGTCGCACCG CGACGACGGA GGCGGACCGC AAGGCCGCCT ACGAGACGGT GCAGGAGCGG
CTGGTCGCGC TCAGCCCGGT CGTGTTCTAC CAGCGGTCGA CGAACGCGGC GATCGGCACC
GCCAAGGTCG GCGGGATCGT CCAGTACGGC AGCGGCTCGC TGCTGGTCGA GGAACTCTGG
ATCAAGAAGT AG
 
Protein sequence
MFRKSRLVAT AVVCGAALAL GACGGGGGDD ATSSTNGAAG QPVAGGEGRI LLLGDPRSLD 
PATLSNQAAI TAPVGNALYG TLMITDQAGK VKYTMAESFD TTDTGKTFTL KLKHGLVFSD
GKPLNAEAVK FNWDRIKDPT VGSSYVVDAR MIESTEVVDD VTLKVTMVNP VPAYAQAVLN
SSLNWIASPD ALKAGRDSFD KNPIGAGPFT LASWTRQADI KFVKNPKYWD APKPYLDRLT
MRSATDATQR LNTVISGGAD VAIDTNTVNI DKAETSDLNA VVTTLNGGNF MAFNSRRAPF
DDIRARQAVS AAIDLEALNL AAYNGTAPLP DTLFDKSSPL FSDTPLHKTD KALAQKLLDE
LAADGKPVKF TFSSFPSSEN RAIAENIQAQ LSAFKNITVS VKIVDLGQVA ALRTTFDFDL
LVSSASFQDP EPRLWQAFSQ DSVANLSGVK DKELSDALLA GRTATTEADR KAAYETVQER
LVALSPVVFY QRSTNAAIGT AKVGGIVQYG SGSLLVEELW IKK