Gene Francci3_4015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4015 
Symbol 
ID3906976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4802257 
End bp4803687 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content70% 
IMG OID637881344 
ProductUBA/THIF-type NAD/FAD binding fold 
Protein accessionYP_483094 
Protein GI86742694 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCACC TCGCAGTGAA CCCACCGGCC ACCGGCTGGA GCCTGACCAT CCCGGTCAAG 
CTCTGGACGA CGCTGGCCGA CCACCTGTTC TCCGACGGCG ACGAGCACGG AGCCGTCATC
CTCGCCGGCT ACGCGGACGG CCCCCGCGGC CCCCGCCTTT TGGCCCGCGA CGTCATACTC
GCCGCCGACG GCACCGACTT CGTCGACGGC ACGACCAGCT ACCGGGCGCT CGACGCGACG
TTCGTCCGCG ACCAAGCCCT CCGCGCCCGC GACGAGAAGC TCGTCTACCT CGCCGTCCAC
AACCACGCCG ACATCCTCCA GCCCGGAGCC GTCGCGTTCT CGACGATCGA CATGGCCAGC
CACGAACGCG GATACCCAGC CCTGCGGCAG ATCACCCGCC AGATCGTCGG TGGGCTCGTC
CTCACCCCCC GGGCCGCCGC CGGCGACCTT TGGCTCCCCG ACGGCACCCG CGCTGTCCTC
GCCGAGACCG TCGTTCCGGG AAACAACATC ATCTGGCTCC GTCCACGGCC GGCACCTGCA
CCTGACATCG ACCCGCGGTG GGATCGACAG GCGCTGCTGT TCGGCCCAGC CGGGCAGCAG
ACCTTCGCCA GAATGCGCGT CGCGGTCGTC GGCCTCGGCG GAGCAGGGAG CATCATCACC
GAACTCCTCG CACGCCTCGG CGTCGGCGAA CTCGTCCTGA TCGACGGCGA CCGGGTCGAA
GCCACCAACC TGCCGAGGCT AGTCGCCGCC GAACCAGACG ACGTCGGCGA ACTCAAGGTC
AATATCGCCG CGCGAAACGC GCGCCGAGCC AACCCCTCCA TCCAGATCAC GGCCATCGCC
GACCGCGTCG AGCATCCTGA CGCCCGCGAC GCACTGACCA CCTGCGACTG GATCTTTCTC
GCCGCAGACG CTCACTCCGC CCGACACTGG GTCAACCTCA CCGTCCACCA GTACCTGATC
CCCGCCACCC AGGTCGGCGT CAAGATCCCA GTAGGTCCAG CCGGCGAGAT CGGTGAGATC
CACACCGTGG CCCGTCTACT GCTGCCCGCC GAAGGCTGCC TGTGGTGCAA CGGGCTGATC
GACTCGACCC AGCTCGCGAT CGAGATGCAC TCCGCAGCCG ACCGACGCAA CGCGCAGTAC
GTGCCGGAGG TCCCGGCAGC GAGCGTCATT GCGCTGAATG CGCTGCCCAC CGCCGAGGCT
GTCAACCACT TCATGTTCGC CGCCGTCTGC CTCCACGACG ACCCCACCGA CAGCGCCTCG
GTCCTGCATC ACCCGCGTGC GCGCGGCCGA GCCCTCCAGG ACGGCCGGCA AGATCCCGAC
TGTCCTTGGT GCACCAAGGC CGGCAGCCTC GCGCGCGGCG CAGGTGACGT CACGGAGGGG
GCCGCCCGGC TCGTCGGCGT GCCCCGCGCG CAGGCCCAAA GTGGCTCGTG A
 
Protein sequence
MAHLAVNPPA TGWSLTIPVK LWTTLADHLF SDGDEHGAVI LAGYADGPRG PRLLARDVIL 
AADGTDFVDG TTSYRALDAT FVRDQALRAR DEKLVYLAVH NHADILQPGA VAFSTIDMAS
HERGYPALRQ ITRQIVGGLV LTPRAAAGDL WLPDGTRAVL AETVVPGNNI IWLRPRPAPA
PDIDPRWDRQ ALLFGPAGQQ TFARMRVAVV GLGGAGSIIT ELLARLGVGE LVLIDGDRVE
ATNLPRLVAA EPDDVGELKV NIAARNARRA NPSIQITAIA DRVEHPDARD ALTTCDWIFL
AADAHSARHW VNLTVHQYLI PATQVGVKIP VGPAGEIGEI HTVARLLLPA EGCLWCNGLI
DSTQLAIEMH SAADRRNAQY VPEVPAASVI ALNALPTAEA VNHFMFAAVC LHDDPTDSAS
VLHHPRARGR ALQDGRQDPD CPWCTKAGSL ARGAGDVTEG AARLVGVPRA QAQSGS