Gene Francci3_3678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3678 
Symbol 
ID3905362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4411051 
End bp4413069 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content68% 
IMG OID637881004 
Productalpha amylase, catalytic region 
Protein accessionYP_482759 
Protein GI86742359 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.763851 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGATCG GACGTTCAGT AGGCCGGGTT GTCGTCTCCG ATGTGACACC GACCGTCTCG 
TGTGGCCAGT GGCCGGCGCG GGCGGTCGCG GGCGAAATTC TCACCGTCGG TGCAACGGTG
TTCCGCGAGG GTCATGACCT CATCGGCGCG AACGTCGTGC TGTCAGGTCC CGACGGGCAG
GGAACTCCGT TCATCCGGAT GCGCTCCGCC GGCCCCGGTA CCGACCGTTA TGAGGCGGAG
ATCACCGCCG GGACCGAAGG CCTGTGGGGA TATCGGGTCG AGGCGTGGGC CGATCCGGTC
GCCACCTGGC GGCACGGTAT TGCCCTCAAG GTCGGCGCGG GCCAGAGCAC GGACGAACTC
GCGGTGGACT TCGAGGACGG TGCGCGCCTG CTGCTGCGGG CCCTGCCCGC AGTCCCCGAA
CCGCGCCGGG CGGAGATCGC TCTCGCCGTG GCCGCGCTGC GCGACGACGA CTGCACCGAC
CCCCGCGACC GGATCGCCGC GGCCCTCGAT CCCCAGCTGG TGAGCCTGCT GGACGCCTGT
CCGCTGCGCG AACTGGTGAC CCGCTCACCG CTGTACCGGC TGTGGGTGGA TCGTCGGCGC
GCCCTCTACG GCAGCTGGTA CGAGATGTTC CCGCGCTCGG AGGGCGCGAG CCTCGACCCG
CCCCGGTCCG GGACCTTCCT CACCGCGGCC GAACGCCTGC CCGCGGTCGC GGCGATGGGC
TTCGACGTGG TGTACCTGCC GCCGATCCAT CCGATCGGCG AGGTCAACCG CAAGGGTCCC
AACAACACCC TCACCCCCGG TCCGACGGAC CCCGGCTCGC CGTGGGCCAT CGGCAGCGAA
CACGGCGGCC ACGACGCCGT GCATCCCGAC CTCGGCACGA TCGACGACTT CGACCTGTTC
GTCGCCCGGG CACGCTCGCT GGGCATGGAG ATCGCGCTGG ACCTCGCCCT GCAGTGCGCG
CCGGACCATC CATGGGCGAA GCATCACCCG GAGTGGTTCG TCGTGCGTAG TGACGGCTCC
ATCGCCTACG CGGAGAATCC GCCGAAGAAG TACCAGGACA TCTATCCGCT GAACTTCGAC
GCCGACCCGA CCGGGCTCTA TCAGGAGATC CTGCGCGTCG TCCGGTACTG GACTGCACAC
GGAGTACGAA TCTTCCGTGT CGATAATCCG CATACAAAGC CCGTCGAGTT CTGGGAATGG
CTCATCGCCC AGGTGAAGTC GACCGAACCA GATGTGCTCT TCCTCGCGGA GGCATTCACC
CGGCCGGCGA TGATGCACAC GCTCGCCAAG GTCGGTTTCA CCCAGTCATA TACCTATTTC
ACCTGGCGCA ACACGAAGTG GGAGCTCGAG AAGTACGCGC GCGAACTGGT GTCGGCCGCG
CACTACATGC GGCCGAACTT CTTCGTCAAC ACCCCGGACA TCCTGCCGGA GTACCTGCAG
CACGGCGGCC CGGCGGCGTT CCGGATCCGG GCGGTGCTCG CTGCGACGCT GTCACCGACC
TGGGGCGTCT ACTCCGGGTA CGAGTTGCGC GAGAACACCC CGGTCCGACC GGGCAGCGAG
GAGTACCTGG ACTCCGAGAA GTACCAGTAC CGGCCACGCG ACTGGGCCGC GGCGGAGCGT
GCGGGCCAGT CGCTCGCGCC GTACCTGACC AGACTCAACC AGATCCGCCG TGCCCACCCC
GCCCTGCAGT GGTTGCGCAA CCTGCACTTC CACCATGCCG ACGGGGACGA GATCATGGTC
TTCTCCAAGC GGGTGGACTC CCTGCGGGCG GACGGCACGG ATCCCGGGGA CACGGCCGCC
GCCGACACCG TGCTCATCGT CGTCAACCTC GACCCGCACG CTCCCCGGGA GACCACCGTG
CGGCTCGACA TGCCGGCCCT CGGCCTCGGC TGGGAAGACT CCTTCGAGGT CACCGATGAG
ATCACTGGTG CCACCTACGC GTGGGGCAAG CAGAACTACG TGCGGCTGGA CCCGGCGGTC
GAGCCCGCGC ACGTCTTCGC TGTGCGGGCC CGGTCGTGA
 
Protein sequence
MMIGRSVGRV VVSDVTPTVS CGQWPARAVA GEILTVGATV FREGHDLIGA NVVLSGPDGQ 
GTPFIRMRSA GPGTDRYEAE ITAGTEGLWG YRVEAWADPV ATWRHGIALK VGAGQSTDEL
AVDFEDGARL LLRALPAVPE PRRAEIALAV AALRDDDCTD PRDRIAAALD PQLVSLLDAC
PLRELVTRSP LYRLWVDRRR ALYGSWYEMF PRSEGASLDP PRSGTFLTAA ERLPAVAAMG
FDVVYLPPIH PIGEVNRKGP NNTLTPGPTD PGSPWAIGSE HGGHDAVHPD LGTIDDFDLF
VARARSLGME IALDLALQCA PDHPWAKHHP EWFVVRSDGS IAYAENPPKK YQDIYPLNFD
ADPTGLYQEI LRVVRYWTAH GVRIFRVDNP HTKPVEFWEW LIAQVKSTEP DVLFLAEAFT
RPAMMHTLAK VGFTQSYTYF TWRNTKWELE KYARELVSAA HYMRPNFFVN TPDILPEYLQ
HGGPAAFRIR AVLAATLSPT WGVYSGYELR ENTPVRPGSE EYLDSEKYQY RPRDWAAAER
AGQSLAPYLT RLNQIRRAHP ALQWLRNLHF HHADGDEIMV FSKRVDSLRA DGTDPGDTAA
ADTVLIVVNL DPHAPRETTV RLDMPALGLG WEDSFEVTDE ITGATYAWGK QNYVRLDPAV
EPAHVFAVRA RS