Gene Franean1_1827 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1827 
Symbol 
ID5670229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2192134 
End bp2193333 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content77% 
IMG OID641240748 
Producthypothetical protein 
Protein accessionYP_001506171 
Protein GI158313663 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.197426 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAGA ACCCGGCGGG TTTCGGGCAG CCGGACGCTG ACCCCGACGA GATCCCCACC 
GAGCCGACTT CACCCCCCGC GGCGGGCGCG GCAGCGGCGG CGGATCAAGC GGCTGCCGGT
GGTTCGGGCT CCGCGGGAGC CCCCGGCTCA GGCCGTCCGG TGGAGTCGCC GAACGCGGAA
GCGGGAGCTG CCGGCTCTGG CGAGGGACGA GAGGACGGCA CGGCACCGGA TGCCGGTTCG
CCGCAGGCGG CTGGTTCCCC CGATACGGCT GGTTCCCCGG AGGCGACTGG TCCGGACACG
GCTGGTTCCC CGGATCGGGG GGCCACGCGC GCCGACACCG CCTCACCGAG CCTGACCGAC
AGCGGGGCTG ACGGGGCGGG CCGGCGGACC CCCCGGGAGC GCCGCTCACC GGCCGCCGAC
GAGGCGGCCT TCCTCGAGCT GATCGCCCGG TTCGACCAGG AGCCGCCGCC GGGCGAGCGG
CTGTGGCCGG CCGCCGAGGA CGTCGACGAG CCCCGACGGC CGTCCGTCAT CATCATCAGG
CCCGCGGGTC GGCAGCCCGA CGGCATGCCG CCGGACGGGC GCCACCTCGC CGGCGACGAC
ACCGACCCGG GCACCGAGCG CACCGAGCCG CCGTCCCCGC TCGGCCGCGG TACGGACGGC
GCCGCCCCGG TGGACCTCGA CGGGACGGGC GCGGACCACC CCAACGCCGA CCTGACCGGT
GCCGACCCGG CCGGCACCGA TCACACGGGC GCCGGGAAAG CAGGCACGGG CCGCCAGGGC
GGCGGCCGGG GCGAGGCCGG GGCCGACGAC GAAGCGGACC GCGCGGGAAA CAGCCGCGGG
CGGGACAACC GCTCCCCGCT CGACGGCATC GCGGGGCTCG ACGCCGCCGT GCGGGCCGCC
TTCGGCACCG CGGGTCGCGA CGCTCCCGAG TATCCGGGCG CGGACGACGA CGATCACTAC
GTGCCGCCGC CACCACCGCC GGTCCCGAAG CTGCGGCCGG TCACCCGCTG GGCGCTGGGC
TCCATCGCGC TCGGCGTCGC GATCCTGGTG GTCCCGACCC TGATCGGGCT CAACCATTCA
CGTTCCCAGG ACGTCGCGGG CGTCCTGCTC ATTCTCGGAG GCGTCGGGAC GCTCGTGGCC
CGCATGGGCG ACCGGCCGCC GACGGACTTC GACGGACCGG ACGACGGCGC GGTCGTCTGA
 
Protein sequence
MPENPAGFGQ PDADPDEIPT EPTSPPAAGA AAAADQAAAG GSGSAGAPGS GRPVESPNAE 
AGAAGSGEGR EDGTAPDAGS PQAAGSPDTA GSPEATGPDT AGSPDRGATR ADTASPSLTD
SGADGAGRRT PRERRSPAAD EAAFLELIAR FDQEPPPGER LWPAAEDVDE PRRPSVIIIR
PAGRQPDGMP PDGRHLAGDD TDPGTERTEP PSPLGRGTDG AAPVDLDGTG ADHPNADLTG
ADPAGTDHTG AGKAGTGRQG GGRGEAGADD EADRAGNSRG RDNRSPLDGI AGLDAAVRAA
FGTAGRDAPE YPGADDDDHY VPPPPPPVPK LRPVTRWALG SIALGVAILV VPTLIGLNHS
RSQDVAGVLL ILGGVGTLVA RMGDRPPTDF DGPDDGAVV