Gene Franean1_2552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2552 
Symbol 
ID5670946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3035815 
End bp3039129 
Gene Length3315 bp 
Protein Length1104 aa 
Translation table11 
GC content70% 
IMG OID641241468 
Producthypothetical protein 
Protein accessionYP_001506888 
Protein GI158314380 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAACAGT TCCGAGCGCG ACTCGACAGA CGTAGCCCCG TCTTAACCGC CCGTCTGCAG 
ATCACGGCGA TGCGCGCTGA AGAACTTATG CGGAACACGG TGCGTACCGG TGACCGTGCC
GGGCTGACCG CGGCGATCGA CCTGCATCGG CAGGCGCTGG ACATCTGTCC TTCCGACCAT
CCGGCGCAGG CCGCGATACT GGCCAATCTG GCCGCCGCAC TGTATGTCCG GTTCGACTGG
TCAGGCAGCA GGGCAGATCT GGATGAGGCC GTCGTCGCAG GCCGCGCGGC GCTGGCCGCC
CATCGACCCG GCGATCCCAA CCTGGCCGGA TGGCTGTCCA ACCTCAGCGC GGCGTTGCGT
ACCCGGTCCG AGCGGTGGGG CAGCAGGGCG GATCTGAACG AGGCCATCAG CGTGGGCCGC
CACGCGGTGG CTGCCAGCCC ACCCGACCAT CCGGAAACGA TCAAACGACT GGGCAACCTG
GCCGCCGCGC TGCGTATCCG GTCCGAGTGG ACGGGCAGCG AGGCGGATCT GGACGAGGCC
GTGAGCGCAG GCCGCGAAGC GCTGGCCGTC AGCTCACACC ACCCGGAAAG AACCACGATG
CTGGCTAACC ATGTGGCCGC GCTACGTCTC CGGTCCGAGC GGACGGGCAG CCAAGCGGAC
CTGGATGAGG CCATCAGAGC CGGCCGGGCG GCGCTGGCCG CCTGGGCATC TGGCGACCCT
GCGAAGGTCG CTCTGGTAAC GAACCTGGGC ACGGCGCTAC AGGCTCGGTT CGAACTGTTG
GGCAACCGGG CGGATCTGGA CGAGGCCATC GGCGCCGCCC GCGACGCGGT GACGGCGAGT
CCACCCGACC ATCTGGACCG TGCCACGGCG GTGGCGAATC TTGGTCTTGC ACTCCGTATC
CGATCGCAGC GGACGGGCAG CGAGGCAGAT CTGGATGAGG CCGTGAGCAC CGGCCGGGCG
GCGCTGGCAG CCAGCCCACC TGACCATCCC CACCGGGCCG GATGGCTGTC CAACCTCGCC
GCCGCGCTGC TTATCCGGTC CGAGTGGACG GGAAGCCAGG CGGATCTGGA CGACGCCGTC
AGCGCCGGCC GCGACGCAGT GGCCAGCAGC CCACCCGACC ACCCGGACAG AGGCACCCGG
CTGGCGAACC TCAGCCGCGC CCTGCTGACC CGGTTCGAAC TGTTGGACAG CCAGGCGGAT
CTGGACGAGG CGGTCACCGC CGGCCGCGAC GCAGTGGCCA GCAGCCCACC CGACCACCCG
GGCATGGCCA GCTACCTGTC CAACCTCAGC CGCGCCCTGC TGACCCGGTT CGAACTGTTG
GACAGCCAGG CGGATCTGGA CGAGGCGGTC ACCGCCGGCC GCGACGCGGT GGCCACCAGC
CCACCCACTC ATCCCTACCG CGCCGCGTGG CTTTCCAATC TGGGCATCGC CCTGCTGACC
CGGTTCGAAC GATCACCCAG CCAGGCCGAT CTAGACGAGG CGGTCATGGC GTGCCGCGGG
GCGCTGGACG CCAGCCCGCC CGATCATCCC TACCGCGCCG CATGGCTTTC TAACCTAGGC
ACCGCCCTAA GCCTCCGGTT CGAGCAGTAT GGCGATCAGG CGGATCTGGA CGAGGCGATT
GCCGCGAGTC GGGCCGCCGT GGCCGTCGAG GTGGCGTCAC CTCGGCTTCG CGCGCGGGCC
GCACGCGGTT GGGGACACAC CGCTGCCGAC GGGATGCGGT GGGACGAGGC CGTCGCGGGT
TTCGCGGCAG CCGTTGACCT CCTGGGACGG GTGGCACCGC GTAGCCTCGC CCGCGGCGAC
CAGGAACATC TACTCGCCGA GCTAGGCGGC CTGGGATCGC AAGCGGCGGC ATGCTGCGTG
CACGCCGGCC TCCCGGAACG CGCGATCGAA CTCTTCGAGC AGGGCCGTGG GGTCCTGCTG
GGTCAGGCAT TGGACACCCG CACCGACCTG ACCGCGCTCG CTGAGCAGTT TCCGGACCTA
GCCAGACGGT TCATCGCTCT GCGTGACGAC CTCGACAGGG TCGATGGCTC GGGCACTCGA
CCGGTGACGA TACCTCCCGG TTGGGACCAT AGCGTCGACA TAACACACGG CGACGTGGAG
CGACGCCGGC AGCTAGCCGA CGCATTCGAC CAGACGATCG GCGAGATCCG GGACCTGCCG
GCCTTCGCGC GTTTCCTGCG TCCACCACCG GTGCAGGACC TGACGGCGGC TGCGGCGAGC
GGTCCGGTCG TCGTCGTCAA CGTCTCCCGG TTCGGCTCGC ACGCGCTGAT CCTGACCACC
GGAGGTGTCC TGGAGCCGGT GCCGCTGGTG GCTCTAACTA CCGAGGCCGT TTACGACCAG
GCGGACGGAT TCCTCGCAGC CCTCGACGAG GTGTCCTCGC CGGACGGAGG CGCCGGAGGT
TGGGGTGCCG CGCAACGGCG GCTGACGGAC ACGCTCGGCT GGCTGTGGGA CGCCGCCACC
GGCCCGGTCC TGGACCGGTT AGGCATCACC GGACCGCCCC AGAAGGACGA GGAGTGGCCG
CGGCTGTGGT GGTGTGTGTC CGGACTGCTG TCGTTCCTGC CATTGCACGC GGCCGGCCAC
CACGCCACCC GCTTCGATCC CGCCCCGAAA ACGGTGGTTG ACCGAGTGGT GTCGTCCTAC
ACCCCGACCA TCCGGGCGCT GATCTACGCC CGCCGGACCC ACTCAGGCGA CAGGGACGTC
GACCGAGGAC GGCTCGGTTC GGACAGCCGG CTGGTGGTGG CGATGCCGCA CACCCCTGAG
GCCGCTGATC TTCCCGGTGC GGATGCGGAG GTCGCCATGC TCCAGCAGCG TTTCCCGGAC
CGGACCAGCA CGCTTGTCGG ACCTCAGGCC ACCCGCGAGG CGGTGCTTGC CGCGCTGCCC
ACGGCGGGGT GGGCGCATCT CGCCTGCCAC GGGTCGAGCG ACCCCAGTCA CCCTTCTGCC
AGTCGGCTGC TCCTCCAAGA CCACCGGCAG CAGCCGTTGA CCGTGGTCGA CGTGGCCCGG
CTGCGTCTGG ACGATGCTCA GCTGGCGTTC CTGTCGGCCT GCTCGACAGC CCGCCCAGGT
AACCGGCTGG CCGACGAAGC GATCCACCTC GCCTCGGCGT TCCAGCTAGC CGGCTACCGA
CACGTGATCG GCACCCTGTC GCCGATCAAT GATCGGCACG CCGCGACCCT CGCCCGCGAT
ATCTACACCG CCCTCGATGA CGCCGACGGC GTCATCGACG CAGCCGCCGC GCTGCATGCC
GCGACCCGCC GGCTGCGCAA CCGATGGGCA CACATGCCGT CGGTGTGGGC GTCGCACATC
CACAGCGGCG CCTGA
 
Protein sequence
MEQFRARLDR RSPVLTARLQ ITAMRAEELM RNTVRTGDRA GLTAAIDLHR QALDICPSDH 
PAQAAILANL AAALYVRFDW SGSRADLDEA VVAGRAALAA HRPGDPNLAG WLSNLSAALR
TRSERWGSRA DLNEAISVGR HAVAASPPDH PETIKRLGNL AAALRIRSEW TGSEADLDEA
VSAGREALAV SSHHPERTTM LANHVAALRL RSERTGSQAD LDEAIRAGRA ALAAWASGDP
AKVALVTNLG TALQARFELL GNRADLDEAI GAARDAVTAS PPDHLDRATA VANLGLALRI
RSQRTGSEAD LDEAVSTGRA ALAASPPDHP HRAGWLSNLA AALLIRSEWT GSQADLDDAV
SAGRDAVASS PPDHPDRGTR LANLSRALLT RFELLDSQAD LDEAVTAGRD AVASSPPDHP
GMASYLSNLS RALLTRFELL DSQADLDEAV TAGRDAVATS PPTHPYRAAW LSNLGIALLT
RFERSPSQAD LDEAVMACRG ALDASPPDHP YRAAWLSNLG TALSLRFEQY GDQADLDEAI
AASRAAVAVE VASPRLRARA ARGWGHTAAD GMRWDEAVAG FAAAVDLLGR VAPRSLARGD
QEHLLAELGG LGSQAAACCV HAGLPERAIE LFEQGRGVLL GQALDTRTDL TALAEQFPDL
ARRFIALRDD LDRVDGSGTR PVTIPPGWDH SVDITHGDVE RRRQLADAFD QTIGEIRDLP
AFARFLRPPP VQDLTAAAAS GPVVVVNVSR FGSHALILTT GGVLEPVPLV ALTTEAVYDQ
ADGFLAALDE VSSPDGGAGG WGAAQRRLTD TLGWLWDAAT GPVLDRLGIT GPPQKDEEWP
RLWWCVSGLL SFLPLHAAGH HATRFDPAPK TVVDRVVSSY TPTIRALIYA RRTHSGDRDV
DRGRLGSDSR LVVAMPHTPE AADLPGADAE VAMLQQRFPD RTSTLVGPQA TREAVLAALP
TAGWAHLACH GSSDPSHPSA SRLLLQDHRQ QPLTVVDVAR LRLDDAQLAF LSACSTARPG
NRLADEAIHL ASAFQLAGYR HVIGTLSPIN DRHAATLARD IYTALDDADG VIDAAAALHA
ATRRLRNRWA HMPSVWASHI HSGA