Gene Franean1_4896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4896 
Symbol 
ID5673236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5876429 
End bp5878345 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content74% 
IMG OID641243751 
Producthypothetical protein 
Protein accessionYP_001509167 
Protein GI158316659 
COG category[S] Function unknown 
COG ID[COG5305] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.303518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0951056 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAGCG CAGACGACCT GCCGACCGGC CAGGTCACGG GCACTACCGC GGAGCCGGCA 
GCCGCGGAGC CGGCAGCCGC GGCTGTGGCG CCTCGGCCGC GGTCGGCGCA GGTCCCACCG
AACGACCATG CGGTGCTGAC CGGCCGGCCC CCGCGCACCG AGCTGCCCGC GGAGACCCAG
CCGGACGCGG AGACCCAGGC GGACGCGCAG ACCATGACCG CTGCGGAGAC CGAGCCGAGC
GCGGAAGCGC CGCCCGCCGC GGAGGCGGAG CCGGGTGAGC AGACCGTGCC CGCCCAGCGG
ACCGATTACC CATCGCGGGC CGAGCCGGCG CGGCTGATCG CCCGGTTCCG CCGGGCCCGG
GACTGGACCG TCGCGGCCTC GTACCAGTTC TTCCTGGCCG GGACGCTCCT GGCGGTCACC
GTCGGTGCCG TGCTGCGGTT CGCCACCCCG AGCCATCTGT GGCTCGACGA GTCGCTCACC
GTCGGCATCG CGTCCCGGCC GGTGCCGGAC CTGCTGCAGG CGCTGCGTCA TGACGGGTCG
CCCCCGCTCT ACTACCTGCT CTTGCACCTC TGGATCAGCG TGTTCGGTGA CGGGGACATC
GCCGTCCGGG CGCTGTCGGG CGTGCTCTCC CTCGCCACAC TCCCGCTGGC CTGGCTCGCC
GGGCGGTACG TGGGCCGGGC GGCGACCGCC CTCGGCAACG GCGGGCCGTC GGCCGAGCAG
CGGGTCGGGC TCGCCGCGCT CCTGCTGTTC GCCTGCTCGC CGTACGCGAT CCGCTACGGC
AGCGAGACGC GGATGTACTC GCTGGTCGTC CTGCTCGTGC TGGTGTTCGG CTTCGCCGCC
GTCCGGGCGC TGAACCGGCC GGACCTGCCA CGGCTCGCCG CGCTGACGCT GGCGACGGCC
GCCCTGGTCT ACACGCACTA CTGGACGTTC CTGGTGGTGT TCACCGTGGC GGCGTTCCTG
CTCCTGCAGG CGCGGCGGCG GGAGCACTAC CGCCGCCCGG CGCTGCGGGC GTTCGTCGCG
ATGGCCGCCT CGGCGGTGCT GTTCGCGCCC TGGATGCCGG TGTTCCTCTT CCAGATGCTG
CACACGGGGA CACCCTGGGC GCCCCGGGTG CAGGCCCAGG TGCTGCTCGA CACGGTCTTC
GACTGGGCCG GCCCGCAGTC GACCGGCGCG CTGCTGGGCA TCATCCTGCT CGGCGGGGCG
CTGATCGGGC TGACGGCCCG TCCGCTCGGC GGGGAGCTGC ACGTCAAGAT CTCCGGGCGG
GCGCCGGGCA GGTACCTCGC CGCGATCTGG CTCGCGCCGC TGGTGCTGGC GTACTTCGTC
AACATGTTCG GCGGCAGCGC CTACGCGGAG CGCTACACCG GGATCGCGCT GCCCGCCTGC
CTGCTGCTCG CGGCGCTCGG GATAGCGCAA CTGCCGGTGC ACCGGGCGTT CGTGGCGGTG
GTCGTGGTCG CCTCGATCAG CGGCCTGCTC GGTGGGTACC AGCTCGCCCG GACGGAGCGG
ACCCAGGCGG GCGAGATCGC CAACCGGATC GCCGACCTCG CCCGGCCGGG GGACGTCGTC
GCCTACTGCC CGGACCAGCT CGGCCCGGCC GTGCACCGGG CGATCCAGCG CCGGGGCGGC
ATCGACGTCC GGGAGATCGT CTACGCGGAC GAGGCCGGGC CGGCGCTGGT CGACTGGGTC
GACTACGCCG ACCGGATGAA ACGCGCGAAC GGTGCGGCCT TCGCGGCCGA GGTCAACGAC
CTGGCCGGGC CCGATCACGC CGTCCTGCTC GTCCGGGCGG ACGGGTACCG CTTTCTGGAG
GGCGCGTGCG CGGTGCTCTC CGACCAGCTG GCGTCCCTAC GGGACCGGGC GTTGCAGGTC
GAGAAGCGCG ATCTTTACGA GGGTGCCTCC CTCGAGCGGT TCTCCACCCT GCGCTGA
 
Protein sequence
MVSADDLPTG QVTGTTAEPA AAEPAAAAVA PRPRSAQVPP NDHAVLTGRP PRTELPAETQ 
PDAETQADAQ TMTAAETEPS AEAPPAAEAE PGEQTVPAQR TDYPSRAEPA RLIARFRRAR
DWTVAASYQF FLAGTLLAVT VGAVLRFATP SHLWLDESLT VGIASRPVPD LLQALRHDGS
PPLYYLLLHL WISVFGDGDI AVRALSGVLS LATLPLAWLA GRYVGRAATA LGNGGPSAEQ
RVGLAALLLF ACSPYAIRYG SETRMYSLVV LLVLVFGFAA VRALNRPDLP RLAALTLATA
ALVYTHYWTF LVVFTVAAFL LLQARRREHY RRPALRAFVA MAASAVLFAP WMPVFLFQML
HTGTPWAPRV QAQVLLDTVF DWAGPQSTGA LLGIILLGGA LIGLTARPLG GELHVKISGR
APGRYLAAIW LAPLVLAYFV NMFGGSAYAE RYTGIALPAC LLLAALGIAQ LPVHRAFVAV
VVVASISGLL GGYQLARTER TQAGEIANRI ADLARPGDVV AYCPDQLGPA VHRAIQRRGG
IDVREIVYAD EAGPALVDWV DYADRMKRAN GAAFAAEVND LAGPDHAVLL VRADGYRFLE
GACAVLSDQL ASLRDRALQV EKRDLYEGAS LERFSTLR