Gene Franean1_7034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7034 
Symbol 
ID5675345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8583805 
End bp8585850 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content63% 
IMG OID641245880 
Productlevanase 
Protein accessionYP_001511271 
Protein GI158318763 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1621] Beta-fructosidases (levanase/invertase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00232409 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTAC GCATCAGACG AAGCGCGGGA AACAAGCATT CCGTGGTCAG GCGCCCCGTT 
ACTCTGACGG CGGTTCTGAC GGCGGCCACG GCGGCAGCCA GCATATTATT AGGCCTGCCG
GCGCCCACTC CGGCGTCGGC AGTCGGTCCG AGTTACCGGG ACACCTACCA CTTCACCGTG
CCCGACCACT GGAAGAACGA CCCGCAACGT CCGGTGTACG TGAACGGCAA GTTCTATTAC
TACTACCTCT ACAACGCCGA TTACGACGCC AACCCGACGG CCAACTACGG CACCGAGTGG
CGGCTCGCGA CCAGCTACGA CGGCGTCGTG TTCGCCGATC AGGGGGTTGC GGCACCGAAG
AAGACGAACG CCAACTATGA TCTGTGGTCG GGCTCAACGG TGGTCGACAC CGCCAATACA
GCCGGGTTCG GCGCAGGCGC CGTCGTGATG CTCGTGACCC AGATGGACCA TCCGACCGAG
GCGCAGAAGC TCAACGCATC CGGTCCGCAG GCCCAGTTCC TCTGGTATTC AACGAACGGC
GGCCGCAACT TCACACCGTA CGGCGAGGCG CCGGTCATCG CGAACGGCGG TCGGGCTGAC
TTCCGGGATC CGAAGGTCCT GTGGGATGCT GACCGCAACC GCTGGGTCGC GCTCATCGCC
GAAGGCCAGA AGATCGGTTT CTACACCTCG GCCAACCTCA AGGACTGGAC CCGGGTCTCC
GAGTACACCA ACAGCGGCCT CGGTATTCTG GAGTGCCCCG ACCTGTTCAA GATGCGTGCC
GATGACGGCA CCACCCACTG GGTAATGGGC ATGAGCGCAA ACGGTTACCT CACCGGCGCA
CCCAGCACGT ACGCCTACTG GACAGGAAGT TTCGACGGCA CGACATTCAC ACCCGACACG
GCGGAGCCGC AGTGGGCCGA CCACGGATTC GACTGGTACG GTGCGGTCAC CTGGGAGGAC
CCGACGGCAC CACTCGACAA GCGCTACGCG ATCGGATGGA TGAACAACTG GTCGTACCCG
CACTCGACGC CGACGTGGCC GAACGACGGC TTCAATGGAA CCGACTCGAT CACCCGCCAA
CTGGCACTGA AGAAATACGG ATCGACATAC AGCCTAACAT CCCAGCCAGT TGCTGCACTG
AACAATATTG CGACACAGAC AACGAACCTC GGTACGTTCT CGGTGAACGG CACCGTGCCG
CTCGCCTACA GTGGTACCTC GTACCAGCTC GAGACGACGG TGACCTGGAG CACTGCACAG
AACATCGGTC TTGGGCTGCG CAGGTCGAAC GACGGTACCC GCCACGCCGA CGTGGGCGTG
CACGACACGT ACAGTTATCT TAACCGAGGT GGTACCACAA ATCCGGACAC TTCTGGACAG
AAGCTTGAAA GTCGCGCTCC GTTCGACATG AGCGCCCAAA CCGTGCACCT GCGTATCCTC
GTCGACCGCA CCACGATCGA AGTGTTTGTC GACGACGGGC GCTTCGTGCA CTCGAGCCAG
GTCTTCCCCG ACCCCGCCGA TGCCGGCATC GCCCTGTACT CGCTCGGTGG GACAGCGACC
TTCTCCAACG TCACGATCAC CGAATTCGGT AGCGTCGTCC AGAGGCCAGC ACGACTGATC
GCGGACTTCG AGGGCTCGAC CTGGGGCAAC GGATGGACCG CGACGGGCTC CTTCGCCTCC
GCCGCGCCGA CCGTGGCATC GCTGCCCGGA CAGGTTGGCG CCAAGGTCGC CGACACCTAC
GTCGGCGGCG GTGACCCGGC GACCGGCACG ATCACCTCCC CGCCGTTCAC CATCGATCGC
AACCATCTCC ACTTCTCCAT AGCGGGCGGC AACCATCCCC TTGGCGCGGA ACCGGCCACG
TCCGTCCAAC TGCTCGTCGG CGGACAACCT GTCCTCACGT CAACCGGGGA TAATTCGAGT
ACCCTGCGAC ATGTCGAATG GGATGTTACT GCCTATGCCG GACAGGCCGC ACAATTCCAG
ATCCTTGACG ACGCGACTGG AACTTGGGGA CACCTCGTGG TCGACCAGGT CGTACTGAGC
GACTGA
 
Protein sequence
MNLRIRRSAG NKHSVVRRPV TLTAVLTAAT AAASILLGLP APTPASAVGP SYRDTYHFTV 
PDHWKNDPQR PVYVNGKFYY YYLYNADYDA NPTANYGTEW RLATSYDGVV FADQGVAAPK
KTNANYDLWS GSTVVDTANT AGFGAGAVVM LVTQMDHPTE AQKLNASGPQ AQFLWYSTNG
GRNFTPYGEA PVIANGGRAD FRDPKVLWDA DRNRWVALIA EGQKIGFYTS ANLKDWTRVS
EYTNSGLGIL ECPDLFKMRA DDGTTHWVMG MSANGYLTGA PSTYAYWTGS FDGTTFTPDT
AEPQWADHGF DWYGAVTWED PTAPLDKRYA IGWMNNWSYP HSTPTWPNDG FNGTDSITRQ
LALKKYGSTY SLTSQPVAAL NNIATQTTNL GTFSVNGTVP LAYSGTSYQL ETTVTWSTAQ
NIGLGLRRSN DGTRHADVGV HDTYSYLNRG GTTNPDTSGQ KLESRAPFDM SAQTVHLRIL
VDRTTIEVFV DDGRFVHSSQ VFPDPADAGI ALYSLGGTAT FSNVTITEFG SVVQRPARLI
ADFEGSTWGN GWTATGSFAS AAPTVASLPG QVGAKVADTY VGGGDPATGT ITSPPFTIDR
NHLHFSIAGG NHPLGAEPAT SVQLLVGGQP VLTSTGDNSS TLRHVEWDVT AYAGQAAQFQ
ILDDATGTWG HLVVDQVVLS D