Gene Francci3_1349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1349 
Symbol 
ID3906562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1620058 
End bp1622553 
Gene Length2496 bp 
Protein Length831 aa 
Translation table11 
GC content72% 
IMG OID637878682 
Productmalto-oligosyltrehalose synthase 
Protein accessionYP_480455 
Protein GI86740055 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3280] Maltooligosyl trehalose synthase 
TIGRFAM ID[TIGR02401] malto-oligosyltrehalose synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAGCTG ACCCGACGCA CCCCGACGGG GGGTACGGCG CCGACCGGGT GGTACCGACC 
GCCACCTACC GGCTCCAGCT CAACCTGGAC TTCTCCCTCA CCGATGCGGC GGTCGTCGTC
CCCTACCTGG CCACGCTCGG CGTCTCCCAC CTGTACCTCT CGCCCGTGCT GGAGGCCGCG
CCCGGCTCGA CCCACGGCTA CGACGTCGTC GAGCACGGGC AGATCAATCC CGAGCTGGGG
GGAGCCGGCG GGCTGCGCCG GCTCGTCGCC GCGTGCCGCA AGGCGGGACT CGGCCTGATC
GTCGACGTCG TCCCCAACCA CATGGCGATC GCCCCGGAAA CGACGAACGC GGCCTGGTGG
TCGGTGCTGC GCGAGGGACC CGAATCCCCC TACGCCAGCT GGTTCGACAT CGACTGGGCC
TCCCCCGACA ACCCCGGCCG GGTGTTGGTG CCGATCCTGG GCGCCGGGCT CGCCGACTGC
CTGGCGGCCG GGGAGATCTC CGTCGAACAG GACGAGGACG GCGACTGGGT CGTCGTCTAC
TACGACCACG TGCTGCCCGT CGCCGCGGGC ACCGCCAACC CCGACGACGT CGCCGCGACC
CTCGACGCGC AGTTCTACCG GCTGTGCTGG TGGCGGGTCG CCGCGACCGA GCTGAACTAC
CGGCGGTTCT TCGATATCAC GACCCTGGCG GCGCTGCGCC AGGAGGACCC GGACGTGTTC
GCCGCCACGC ACCGGATTCT CATCGAACAG GTCCGCGCGG GGACCTTCGA CGGGCTGCGC
ATCGACCATC CGGATGGGCT CGCGGACCCG GAAGATTACC TGCGCCGGCT GTCGGAGGCC
ACCGGCGGGG TGTGGACCGT CGTCGAGAAG ATCCTGGAGG ATGACGAGGC GCTGCCGGAC
ACCTGGGCCT GCGACGGCAC CACCGGCTAC GAGGGGATCG GCCGGCTGAC CCGGCTGTTT
CTCGACCCGA TGGCGGGTCA TCCCCTCGCG GTGCTCTACG GCGAGATCAC CCGTGCGGAC
CCCGACTACG AGGCCGAGGC GCGGGCCGCC AAGCTCGACG TCCTCGCGGG CGTGCTCCAG
CCCGAGGTGG ACCGGCTGAC AAGCCTCGCG CTCGGGGAGG CCAGGGAGGC CCGGGCCGAC
CTCACCCGAA CCGGTCTGCA CGAAGCGATG TGCGAGATCC TCGCCGCCTT CGACGTCTAC
CGGGCCTACA TCCGCCCAGA CGGCACGCCG AGCCTGGAGG CCCGCGGGCA CGTGGTCCGC
GCCTGCGAGC AGGCCCGGTC CCACCTGCCG GGGCGGGCGA CCGAGGTCGA TCTCATCGAG
GACCTGGCGC TGGGCGGGCC GGCCGAGTTC GTCGTCCGGT TCCAGCAGAC CTGTGGGCCG
GTGATGGCCA AGGGCATCGA GGACACCGCC TTCTACCGGT ATGCCCGGCT GCTGGCACTG
AACGAGGTCG GCGGCAACCC CGGCCGCTTC CCGGCGTTCA CGACCGGGCA CCCACGCAGC
GCCGTCACCG AGTTCCACGA GGCGAACCTG TCGGTCCAGC GAAACTGGCC ACTGACGATG
ACCACCCTGT CCACGCATGA CACCAAGCGG TCGGAGGATG TGCGGGCCCG GCTCGCGGTG
CTGTCTGAGG ACCCCCGCGG CTGGGCCGAG GTGGCCGGCC GGCTCGCGCG GCTCGGGGAA
CGCCACCGCG ACACCGAGCA GGGCTGGCCC GACCGGGTCA CCGTCTACTT CCTGCTGCAG
ACCCTGGTCG GCGCCTGGCC GCTGCCCGCC GACCGGGCCA CGCAGTACAT GCTCAAGGCC
GTCCGGGAAG CGAAGACCCA CACGAGCTGG ACCGACCAGG ACCCCGGCTA CGAGGCGGCC
CTGACGAACT ACATCGAGTC AGTGCTGGAG GACGACGAGT TCGTCGGCAT CCTGGAGCGG
TACGTCGCCA CGCTGGTGGA GCTGGGCCGG CAGAACTCGC TGGCCCAGAA GCTGCTCCAG
CTGACCATGC CGGGGGTGGC GGACGTGTAC CAGGGCCAGG AGCTGTGGGA TCTCTCCCTC
GTCGATCCGG ACAACCGGCG GGCGGTCAAC TACGGTGACC GCACCAAGCT GCTCGCCGAG
ATCGGTGTGG AGTCGACCGC CGAGACCGGT GCGCCGCGGC GCCCGCCGAC CCTCGACGAC
TCCGGCGCGG CGAAGCTGCT CGTCGTCGCC CGGGCGCTAC GGACCCGCCG CGACCATCCC
GAATGGTTCG GGGCGGACGC GATCTACCGG CCGCTGTGGG CCTCCGGCTC GGCCGCGGAG
AACGTCGTCG CCTTCAGCCG GTCCGAGTCC GTCGTCACCG TGGTGCCCCG GCTTGTCCTC
GGCCTGCGCC GCGGCGGCGG TTGGCGGGAC ACCACCATCC CCCTGCCCGA GGGACGCTGG
ACGGACGTGC TCACCGGCCG CAAGCACGAC GGCGGCACCG CCTATGTGCT CCGGCTGCTA
CGCGACTTCC CGGTGAGCCT GCTCGTCCGC GCGTAG
 
Protein sequence
MSADPTHPDG GYGADRVVPT ATYRLQLNLD FSLTDAAVVV PYLATLGVSH LYLSPVLEAA 
PGSTHGYDVV EHGQINPELG GAGGLRRLVA ACRKAGLGLI VDVVPNHMAI APETTNAAWW
SVLREGPESP YASWFDIDWA SPDNPGRVLV PILGAGLADC LAAGEISVEQ DEDGDWVVVY
YDHVLPVAAG TANPDDVAAT LDAQFYRLCW WRVAATELNY RRFFDITTLA ALRQEDPDVF
AATHRILIEQ VRAGTFDGLR IDHPDGLADP EDYLRRLSEA TGGVWTVVEK ILEDDEALPD
TWACDGTTGY EGIGRLTRLF LDPMAGHPLA VLYGEITRAD PDYEAEARAA KLDVLAGVLQ
PEVDRLTSLA LGEAREARAD LTRTGLHEAM CEILAAFDVY RAYIRPDGTP SLEARGHVVR
ACEQARSHLP GRATEVDLIE DLALGGPAEF VVRFQQTCGP VMAKGIEDTA FYRYARLLAL
NEVGGNPGRF PAFTTGHPRS AVTEFHEANL SVQRNWPLTM TTLSTHDTKR SEDVRARLAV
LSEDPRGWAE VAGRLARLGE RHRDTEQGWP DRVTVYFLLQ TLVGAWPLPA DRATQYMLKA
VREAKTHTSW TDQDPGYEAA LTNYIESVLE DDEFVGILER YVATLVELGR QNSLAQKLLQ
LTMPGVADVY QGQELWDLSL VDPDNRRAVN YGDRTKLLAE IGVESTAETG APRRPPTLDD
SGAAKLLVVA RALRTRRDHP EWFGADAIYR PLWASGSAAE NVVAFSRSES VVTVVPRLVL
GLRRGGGWRD TTIPLPEGRW TDVLTGRKHD GGTAYVLRLL RDFPVSLLVR A