Gene Francci3_0437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0437 
Symbol 
ID3903626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp519384 
End bp521033 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content66% 
IMG OID637877769 
ProductFAD dependent oxidoreductase 
Protein accessionYP_479553 
Protein GI86739153 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGT ACGACTATTC GGTTGCCATC CTCGGTAGCG GCATCGCCGG CTCGACGCTG 
GCCAATATCC TCGCCCGACA CGGTCACCGC GTGGTTCTGA TTGACAGTGG CACGCATCCA
CGTTTCGCGC TCGGGGAGTC GACCATCGGT GAGACGACCT ATCTGCTGAA GTTGCTCGCC
CAGCGGTTCG ACGTCCCCGA GCTCGGCCAT GTGAGCAGCT TCGAGGGCGT CCGCTCCCAC
GTCACCTCGG CCTGCGGAGT GAAGCGCAAC TTCGGATTCG TCTACCACCA CGAGGGTGCG
CTCCAGAACC CCGAAGAGGT CACTCAGTGC AGCGTTTCGG AGTTTCCGAA CGGCCCGGAA
ATGCACATGC ACCGCCAGGA CATCGATGCT TACCTCTTCT ATACGGCCGT CCGTTACGGA
GCCGAACCGC GGCAACGCAC GATGGTGGAA AAAGTCGACT TCGCCGACGA CGCCGCGACG
CTGACGACAG GGGCCGGCGA ACGGATCCGC GTTCGCTACG TCGTCGACGC CTCAGGACGC
AACTCCGTGT TGGCGAACCA GTTCGCCCTG CGAGAGGATC CCTGCCGGTT CCGGACGAAC
TCCCGCACCC TGTTCACCCA CATGGTGGGG GTCACCCCCT TCGACGAGGT GACCCTCCCG
AGCGGGCAGC CGTCGCTGTG GCACCAAGGC ACCCTGCACC ATCTGTTCGA CGGCGGCTGG
CTGTGGGTGA TCCCGTTCGA CAATCACCAG CGGGCCACCA ACCCACTGTG CAGCGTCGGG
CTCAACCTCG ACTCGCGACG CTTCCCCAGG GATCCCTCGG TACCCGCCGA GCAGGAATGG
AACGCTTATC TTGAGCGGTT CCCCAGTATC GCGCGCCAGT TCGCCGGTGC CCGGCCCGCC
TGGGACTGGA TCTCCACCGG TCGCACCCAG TACTCCAGCT CGCGGACGGT CGGTGACCGC
TGGGCGATGA TGTCCCACGC AGCCGGGGCG ATCGACGCGC TGTTCTCCCG TGGCATGGCC
AACACCATGC AGGTCATCTA CGCCCTCGCA CCAACCCTGA TAGAGGCCCT CGCCGACGAC
GACTTCTCGG CGGAACGCTT CGGCCACATC GACACGCTCA ACCAGACGAT CCTGGACGTC
AATGACAAGC TCGTACACGG CTCCTATGTC TCGTTCCGCG ACTTCGACCT GTGGCGGGCG
TGGTCGAAGG TGTGGTTCCT GGGCTGGAAC ATGGGCATCT CCCGGATCGT CGGAACCTAC
TTCGGCTATC TGGAAAAGGG CGATCCGGCG CTGTTCGACC GCCTGCTCGA CGCGCCACAT
CTGGGCACTT TCTGCCCCGA TCTGCCGGAA TTCCAGCCCT TCTTCGACTC GCTCAGCGCC
GTGATGGACG AGGTCGAGGC CGGCCGGCTG GCCCCGGCCG CCGCCGTCGA ACGGCTCGCC
ACTCTGCTGG GCGGCGCCGA CTTCCTCCCC GCTCCGCTCC GGCTGGGCGA TGTGCTGCGC
CGCTGGCACG ACGGCTCCTT CGAAGCCCAG CGCCGCATGT ACGAGTGGGG GCGCACGAGT
TCCCCCGAGC CGCTCCGCCG CTGGTACGAG TACGACCTCG ACGACCTGCT CACCCGCACG
GGTGGGGTGC CCACTCCGGC AACCCTCTAA
 
Protein sequence
MSGYDYSVAI LGSGIAGSTL ANILARHGHR VVLIDSGTHP RFALGESTIG ETTYLLKLLA 
QRFDVPELGH VSSFEGVRSH VTSACGVKRN FGFVYHHEGA LQNPEEVTQC SVSEFPNGPE
MHMHRQDIDA YLFYTAVRYG AEPRQRTMVE KVDFADDAAT LTTGAGERIR VRYVVDASGR
NSVLANQFAL REDPCRFRTN SRTLFTHMVG VTPFDEVTLP SGQPSLWHQG TLHHLFDGGW
LWVIPFDNHQ RATNPLCSVG LNLDSRRFPR DPSVPAEQEW NAYLERFPSI ARQFAGARPA
WDWISTGRTQ YSSSRTVGDR WAMMSHAAGA IDALFSRGMA NTMQVIYALA PTLIEALADD
DFSAERFGHI DTLNQTILDV NDKLVHGSYV SFRDFDLWRA WSKVWFLGWN MGISRIVGTY
FGYLEKGDPA LFDRLLDAPH LGTFCPDLPE FQPFFDSLSA VMDEVEAGRL APAAAVERLA
TLLGGADFLP APLRLGDVLR RWHDGSFEAQ RRMYEWGRTS SPEPLRRWYE YDLDDLLTRT
GGVPTPATL