Gene Franean1_6999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6999 
Symbol 
ID5675310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8527514 
End bp8530261 
Gene Length2748 bp 
Protein Length915 aa 
Translation table11 
GC content75% 
IMG OID641245845 
Productfibronectin type III domain-containing protein 
Protein accessionYP_001511236 
Protein GI158318728 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTCG ACGTCGAGGC GTACCGGAGG GGCGTCGTTG ATCCCGCCCG GAAGCAGGGA 
GTGCCGAACG ACCTGTTCAC GCGCTACGCG CTGGACGAGG CCACCGCGGG TGACAAGGAG
CTGTTCGAGG CGCGGGTCGA GGAGGTCACG AAGTACTGGC GGACTCTGAA GCTCAAGAAG
ACCCACGCCG CCGTCGCGGA TGCCCTGCTC GCGGCGCACG CGGAGCTGAG GGGCTCCGGC
CAGCTCACGC CGGCGGCCTT CCGGGAACGC CGCGAGCGGG CGCGCAAGGA GGAACAGGAG
TACCTGGACC GCCTCGTCAC CGCGCTGGCC GCGGCGAACC CGTGTATCAG CGAGGCGATG
GTGGGCCGCC TGATCTCGGG GACCGCGGGG GCCGCCGGTG CGGGCAGCGG CGGGGGTGCA
CTCGACGCGG CGACCGTCCG GGCCGCGCTG AAGCGGCACA ACCTGCGGGT CATGGACCGG
GAGTGGAAGC TCCCGGTCGG CCCGCCGCCG AGTCAGGCGC GCTCGCTGCG CGAACCGCTG
GCCGTTCTCG GCCTGCGCCT GTCCGCCGAC GTCGTCTTCG GCGCCGAGCG GGTCCGGGCC
GGCTTCCGGC TGCGGGACGG CTTCCGGCTG GTGTCCGAGC CGGCCGTCGT CAACGAGCAG
GAGGTGACCC GGGCCCGCCA GGCCCGCGCC CGCGCGGCCC AGGACGAGCG TCAGACCGCG
GCCGAGAACA CCTTCGTGAT CATTCTCGGC TCCGTGCGCT CCCCGGCCGG GTACGACGCC
CTGATCGGCT GGGAGATCGT CGAGGCGCTG CGCGGCGGGG TCGAGGCAGG CCTGCCCGCT
CGCGCGATCG CCGACCAGGC GGCCTCCCTG GGCCTGGTCC TGGAGGAGGC CGAGGAGCTG
GCGGTCACGC TCGCCGAGGC CTGGCGGGGT GGCGCAGCCG GGTCCGGGAA CGCCGTCCAG
GAGATCCACG ACGCGCTCGC GGCCGGCCGG CTGCGCGCCG CTCGCCAGCT TGCCGCCGCG
TTGCCGGACG CCGAGCGGGC CGAGATGTCC GGGCAGGTCG AGGCGGCCGA AGCCAGGGTT
CGGACGTTGC TGACCGGCGC GGACGAGGCA TGGTCGCGCC ACCGGCCGGA GGAGACCGCC
GCGCTGCTGG CCGGGGCTGC CGAGATCGCG GCCGATCATG ACGACATCCA GGCCCGGCTC
CGCGCGATCC CGCCGCCGCC GCCACCGTCG GTCGAGGCGG GGGCCGACGG GACCAGGGTG
AGCGTCCGGT GGACACCCAG CCCCGCCCGC ACCGGCGAAG TCGCCTACCG GCTGGTGCGC
GCGACCGGCC GAGCCGCCAC CACGCCTGAT GCCGGGGAGC CAGTCGCCGA GACACCGGCC
AACACCGCGG TCGACGAGGC CGCTCCACTG GGCGAACGGG TCTTCTACAC GGTCTTCGCC
ACCCGCGCCG AGGGCATCTG GTCGTCGGGC GCGGATGCCG CCGCCGTTCT CCTCACCCCG
GAGGTCCAGG ACCTCGTGCT GGCGACCGAC GCGGAGTCGG TGACCGGCAC GTGGCGGGCG
CATCCGGGGA TCCACGAGGT CAACGTGCGC CGGACCGAAC GTGGCATGAC CGGTCGCGGG
ACAAGGCCGA CCTCCGTCAC GGCCACCGGC TTCACCGACA CACGCCTGCG GTCGGGCACG
GCCTACCAGT ACCGGATCGA GGCGGTGTAC CTCGGAGCGG ACGGGGCTCG GCGCGTCTCG
GCCGGGCTGG TCGTCACGGC CCGGCCGGAG CGGCCGCCTG AGGCGGTGAG CGACCTCGCC
GTCGAGCTTC CCACCGAGAC GGCATCGTCC CCGTCCGGGG CGTCGACCCG GTCGGACGGG
TTGTCGGCGC TCGTTGTCTG GACGCCGCCG ACAAGCGGGA CGGTGCAGGT GCGGATGGCC
TCCACGGCAC CAAGGTGGGC CGCGGGGACC GCTGTGTCAG CCGCGGATGT CTCCCGGCAC
GGCGTCGCGC TGCCCGGCGC ACCCGTGACC CGGGCCGACG GACGGGTCGC GCTGCCCGTC
CGCCCGCGGC ACGGCCGCGC CTTCCTCACC GCGGTCACCT GCGCCGCCGG CGGCTCCGTC
GCGGTCGTCG GCGCCACGGT GCCGATCTCG CTGGCGGACG CGGTCCGCGG GCTCACAGCG
GCTCGGTTCG GCGATGCCGT CCGGCTGCGC TGGGACTGGC CGGAGGGCGC GAGCCTGGCA
CGGGTCCAGT GGTACCCGGC CGGCTCCGCA GGTGACGGGC CGACCGGTGA GGTGGAGATC
CGACCGCGGC GCTACATCGA CAGCGGCGGG CTGGAGATCC AGGTCGGCTC GGATCCGGTC
ACCGTGGCGG TCCGGACGGT GACCGGCGAG GAGGAGGACC GTGCGGTCTC CCCGCCGGTC
ACCGTGCTCG TCCCCGGGCG CGGTGTCGAG GTGACCTACG AGCTCCGGCC GGCGCGGCTG
CCCTGGCCGC GGCGGGCCGT CCTCGAGGTG GTGTGTGACC GCACCTGCCG GCTGCCACCG
CTGGTCGTCG TCGGCAGGTC GGACGGGATC CTGCCGCTGA GCGCCACCCA TGGCACCGCG
CTCGCCCGTC TCCCGGCGCG CGACGTCACC GCTCGGAAGC GGGCGCTGAT CACAGTTCCG
GCGCCACGGC GCGGTGCCTG GGGGCTCGCC TGCTTCGTCG ACCCCACCGC GCCACCCGAG
GACGCGAATG CCCAGGTGAC GTTGGTCCGC GCCCGACGGA GGAGGTGA
 
Protein sequence
MAFDVEAYRR GVVDPARKQG VPNDLFTRYA LDEATAGDKE LFEARVEEVT KYWRTLKLKK 
THAAVADALL AAHAELRGSG QLTPAAFRER RERARKEEQE YLDRLVTALA AANPCISEAM
VGRLISGTAG AAGAGSGGGA LDAATVRAAL KRHNLRVMDR EWKLPVGPPP SQARSLREPL
AVLGLRLSAD VVFGAERVRA GFRLRDGFRL VSEPAVVNEQ EVTRARQARA RAAQDERQTA
AENTFVIILG SVRSPAGYDA LIGWEIVEAL RGGVEAGLPA RAIADQAASL GLVLEEAEEL
AVTLAEAWRG GAAGSGNAVQ EIHDALAAGR LRAARQLAAA LPDAERAEMS GQVEAAEARV
RTLLTGADEA WSRHRPEETA ALLAGAAEIA ADHDDIQARL RAIPPPPPPS VEAGADGTRV
SVRWTPSPAR TGEVAYRLVR ATGRAATTPD AGEPVAETPA NTAVDEAAPL GERVFYTVFA
TRAEGIWSSG ADAAAVLLTP EVQDLVLATD AESVTGTWRA HPGIHEVNVR RTERGMTGRG
TRPTSVTATG FTDTRLRSGT AYQYRIEAVY LGADGARRVS AGLVVTARPE RPPEAVSDLA
VELPTETASS PSGASTRSDG LSALVVWTPP TSGTVQVRMA STAPRWAAGT AVSAADVSRH
GVALPGAPVT RADGRVALPV RPRHGRAFLT AVTCAAGGSV AVVGATVPIS LADAVRGLTA
ARFGDAVRLR WDWPEGASLA RVQWYPAGSA GDGPTGEVEI RPRRYIDSGG LEIQVGSDPV
TVAVRTVTGE EEDRAVSPPV TVLVPGRGVE VTYELRPARL PWPRRAVLEV VCDRTCRLPP
LVVVGRSDGI LPLSATHGTA LARLPARDVT ARKRALITVP APRRGAWGLA CFVDPTAPPE
DANAQVTLVR ARRRR