Gene Francci3_3506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3506 
Symbol 
ID3905240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4187054 
End bp4188391 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content69% 
IMG OID637880828 
Producthypothetical protein 
Protein accessionYP_482588 
Protein GI86742188 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3408] Glycogen debranching enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.170778 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTCC CCAGCCTTCC CGCAGACCTG GCGGCACTAC GGGAAACCGC GATCGACGTG 
CTACGAGACA ACGACCTCGG CGACATCACC CGGCCCTCCC CGACGCTCTA CCCGCACCAG
TGGCTCTGGG ACAGCTGCTT CATCGCGATC GGACTGCGCC GTCTCGACCC GGGACGGGCC
GCCAAGGAGG TCCTCTCGCT GCTGCGAGGC CAGTGGCCGA ACGGCATGAT CCCCCACGTG
ATCTTCGCGG AGACCTCGGA CTTCTACCAC GCCGGTCCGC AGCGCTGGCG CTGCGACCAG
GTGACCACGA CCGCCGGCGG AGTGCAGAGC ACGGGGGTCA CCCAGCCGCC GATGATCGCC
GAAGCCGCCG TCCGGATCGC AGCCATGATG ACGCCATCCG CCCGCGACGC CTTCTACCGC
GCGTTGTTCC CCGGCCTGCT GCGCTTCCAC ACGTGGCTCT ACCTCGAACG CGACCCCGAC
GACGACGGTC TCGTCACCCT GGTCCACTCC TGGGAGTCGG GGATGGACAA CACGCCGGCC
TGGATGGAGA TCACCAGGCC GGCGGCTCCG CTGGGAGTGC GGGCACTGCG CCGGATCAAC
GGCGGCGACG CCCTGGACGC GCTGCGGCGC GATTCCAAGG AGGTTCCCCC CGACGAGCGG
CTCACCTCGG GAGACCTGTT CACGCTCTAC CGGATCGTGC GTGAGCTGCG GCGGGCGCAC
TACAACTTCC GCGAGATCCG TCGGAGCATC GTCCCCCTCG TCCAGGACGT CGCCTTCAAC
GCGATCCTCA TCCGGGCCAA CGAGCACCTC ACCGCCATCG CCGCCGAGAT CGGCCAGACC
ATCCCCGCCT GGCTGTGGCG GTCCATGCAC AGGACGCGCG AGGCGATCGA GCGGCTCCAC
GCCGACGGCA CCTACTACAG CCGCGACTTC CGGACGGGAA CGCTGCTGCG CCACGAGACG
ATCTCGGGGT TCCTGCCGCT CTATGCCGGG GTCGTGCCCG AGGACCGGGT CGACGAGATG
GTCAAGACGC TCACCTCACC GCGCTACTGG TCCCGGTTCG GCATCGCGAG CGTGCCGCTC
GACGACCCGG GCTTCCTTCC CCGGTGTTAC TGGCAGGGTC CCGTCTGGGT CAACATGAAC
TGGCTGATTG CCGACGGGCT GGAGCGTTAC GGTCGGCTCG ACGCCGCGGA GAACCTGCGC
CAGAACACGA TCGACATGAT CGCATCGTCC GGGGCGATGT TCGAGTACTA CTCACCCCTC
GACGGTTCCG GCGCGGGCAG CAACCGGTTC TCCTGGACCG CGGCGCTCCT GGTCGACCTC
CTCGCCGCCG AGCACTGA
 
Protein sequence
MSVPSLPADL AALRETAIDV LRDNDLGDIT RPSPTLYPHQ WLWDSCFIAI GLRRLDPGRA 
AKEVLSLLRG QWPNGMIPHV IFAETSDFYH AGPQRWRCDQ VTTTAGGVQS TGVTQPPMIA
EAAVRIAAMM TPSARDAFYR ALFPGLLRFH TWLYLERDPD DDGLVTLVHS WESGMDNTPA
WMEITRPAAP LGVRALRRIN GGDALDALRR DSKEVPPDER LTSGDLFTLY RIVRELRRAH
YNFREIRRSI VPLVQDVAFN AILIRANEHL TAIAAEIGQT IPAWLWRSMH RTREAIERLH
ADGTYYSRDF RTGTLLRHET ISGFLPLYAG VVPEDRVDEM VKTLTSPRYW SRFGIASVPL
DDPGFLPRCY WQGPVWVNMN WLIADGLERY GRLDAAENLR QNTIDMIASS GAMFEYYSPL
DGSGAGSNRF SWTAALLVDL LAAEH