Gene Franean1_2700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2700 
Symbol 
ID5671091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3194321 
End bp3195517 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content63% 
IMG OID641241612 
Producthypothetical protein 
Protein accessionYP_001507032 
Protein GI158314524 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.75707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGACCCC TGGGGGTCGT CGCGGCAGCA CTGGTAGTTT CTGCCGCTGC GGTCGGCTGC 
TCGGCAGGTG AGTCCGAGAA CAACCAGGCA GGTGCCTGCG AAAGCCCTGG TGTGACATCC
GACCAGGTGA AGATCGGATT CGTCGTCTCG GACTCCGGTC CCGGCCATGA AGCGCTGTCC
TCGGCACGTG CAGGTGTGGA GGCCAGAATA GGTCTCGTCA ATGCGGCCGG CGGAATAAAT
GGCCGTGAGA TCACCTATGA TTGGCGTGAT GACGGGAGCT CCGCCTCGGT AAACGTGGTG
GCGACCGAGG AGCTTGTGCG GAGTGAGCCG GTTTTCGGTC TTCTGGCGGT GACGACTGCG
CTCGACCCGT CGATGGATGC CCTGGAGGCG GCGGGGATTC CGGTGGTGGG GCTGGCGGCC
AGTCCCGGCT GGGCTAAGCA TCGGAACATG TTCTCGTACT CGTACGAGGC TTCGCCCGTG
ACGATCGGTC GCTATATTCA GTCTCTCGGC GGAAGAAAGG TGGCAGTGCT GGTCGCGGGA
GCCATTGACT CTGTACCGGA GACCGTGGCG AAATATATTG CGGAGATGCG CGCGGCCGGC
GTCAACGTCG TCGGCTCGAT TTCTTATGCG AGTGCGGCCG AGAGTCCATC CCGGGTCGTG
CAAAGGATTG CCAGTCTTGG AGCTGATTCC ATAGTTGGCT TCACCACGGC GGAAGAATTT
GCGGAAATCA TACAAGCCGC TCGCTCCGCC GAGCTGCGCA TCGTGGCCAG CGTTGCCCAG
GCCGTGTATG ACCGTGCGTT GCTGCCCACG TTCGGGCCGG CGCTTGCAGG AGTTTCGGCT
CCCGTGTACT TCCGCCCTTT CGAAGCAGGT GGCGAGGCCA TGGACCGCTA CCGCGATGCG
ATGGCCCGGT TCGCCCCGGA GACCGGGGTT CCCGAGCGGC AGTTTGCGTT GCTTGCATAC
GTCTATACTG ACTTGTTCAT CCAGGGCCTT GAGCTGGCTG GCGCCTGCCC CACCCGCGCA
GCCTTCATCG AGGGTCTGCG AAAAGTCACC TCCTATGATG CCGGCGGTCT GATCGAGCCG
GTGAGCCTCA GGGACAACCT CGGCGTGCAG CTCAGCTGTT ATGCGTTCGT ACAGGTCAGT
CCGGCGGGTA ATGAGTTCCA GGTCGTCCAG GAGCGCGTGT GCGCTGACGG TAAATGA
 
Protein sequence
MRPLGVVAAA LVVSAAAVGC SAGESENNQA GACESPGVTS DQVKIGFVVS DSGPGHEALS 
SARAGVEARI GLVNAAGGIN GREITYDWRD DGSSASVNVV ATEELVRSEP VFGLLAVTTA
LDPSMDALEA AGIPVVGLAA SPGWAKHRNM FSYSYEASPV TIGRYIQSLG GRKVAVLVAG
AIDSVPETVA KYIAEMRAAG VNVVGSISYA SAAESPSRVV QRIASLGADS IVGFTTAEEF
AEIIQAARSA ELRIVASVAQ AVYDRALLPT FGPALAGVSA PVYFRPFEAG GEAMDRYRDA
MARFAPETGV PERQFALLAY VYTDLFIQGL ELAGACPTRA AFIEGLRKVT SYDAGGLIEP
VSLRDNLGVQ LSCYAFVQVS PAGNEFQVVQ ERVCADGK