Gene Francci3_4178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4178 
Symbol 
ID3907143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4981531 
End bp4983207 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content70% 
IMG OID637881506 
Productmonooxygenase, FAD-binding 
Protein accessionYP_483255 
Protein GI86742855 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.876229 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGTAC AGCGGGACCA TGCCGTCATC GTCGGCGGAG GTATTGCGGG ACTCACCGCG 
GCGCGCGTGC TGAGCGAACA CTTCGCCGCG GTCACCATTC TCGACCGAGA CGCCCTGCCG
ACAACCGTCG AAGAACGGCG CAACGTCCCG CAGGCGCCTC ATCTGCACGG CCTGCTGTTC
GGCGGCGCAT CGGCGCTCGA CGAACTCTAC GGCGGCTTCA CGGTCGCGAT GACCGACCAG
GGCGCGCCGC CGTATGACCC GCAACGCGAC GTCGCCTACT TCTTTCCCGA AGGCTGGATC
CGGCGTGCGC CATCCGACAT GCGGTCCGTG TTCGGCACGC GGGCCCTGAC CGAGCACACC
ATCCGGACGC TGACCCGCCG GCTACCGAAC GTCCGGTTCC GCCACGCCGA GACCATCGGC
CTGCGCAGGG CCGGCGGCGA CCGGGTAGAC GCGGTCGTGT GCGACGACCT CGACGGTCTG
GGCGGACTGA TCGAGGCCGA GCTTGTGGTG GACGCCGGCG GCCGCGCATC TCGCTCCCCG
CACTGGCTCG CCGACGCCGG GTTCGACGCC CCGCCGGAGT CGACGGTGCA ACCGTTCCTC
GGTTACGCCA CCGTGCACTG CCACTTGCCC GAGGATGCCC TACCGGGAGA CCTACGGGCA
GTGTGCGCGC CACCGGCACC GCACAACACC AGAGGTGCGT TCCTCCTGCC GGAGGAGAAC
AACCTCTATG GCCTCATGGC CGTCGGGACC AGCCGTGACT TCCCGCCCGG CGACCCGGCG
GGCTTCGACG AGTTTCTCCG GACCGCCGTC ACCCCCGTCC TGCACGAGAT GTGGCAACGT
GCCGAGCCCG TCACCGACAT CAGGACGACG AGGATGTCAG TGAATCGGCT CCGCCGCTGG
AACGAGCTGG CTCGGCGGCC ACAGGGGTTC ATCGCGGTCG GCGACGCGGT GGCCGTATAC
AACCCGGTGT ACGGCCAAGG CATGACCGCG GCCGTCCTGC AGGCGGTGGC CTTGCGCGAC
CGCCTGCGTC AGGCGGACGA TCTGGACACG GCCGTCGAGC GTCTCCACGA CGACGTCATG
GCCGTCACCT CGTTCGCCTG GCAGGCCGCG ACCGCATCCG ACCTGGTCTT TCCCGTGACG
GAAAGCCGTA ATATGCCCGC CCCGACACCC GAAGAGCGGG CGGGCGCACA ATACCTGGGC
ATGGTGCGCG CCACCGCCGT CGACGACGCT TACGTCGCGG GCGAATTCTA CCGGGCGCTG
GGCTTGATCC GGCCCGAGTT CCTCCTCGCT GACGAGGTCC GCACCCGAGT CGAACGGTGG
GTGGCGGACC CGCCGGCTCC GACCGACGAC CTGTCGCGCC CGCCTGCGTG GGCCGATGCC
GATCCACCGA CACGGAGTCG GCCGCCGTCG GGCACTGCCG CCGTCGGGCA CTGCCGCCGT
CGGGCACTGA GGAATGACGG GCGGCGGATC CCGGGTAGGA AAGCACCCCC AGCCACTCGA
AGGCGCCTTC GAGTACTCCT CTACCGCGGA CATCCCGAGG GTGGGGTGCT TTCCATCGTT
GGACGGCGCC CCACCCTCCG GTCTGCCTCG GCGGCTACGC CCCGGCGAAC GGCTGCTGCC
CGCCGAACCG CTGCCCGGCC TGCTCCGCCC CGGCCTGCTC CGCTTGGCCC GCGGTAA
 
Protein sequence
MPVQRDHAVI VGGGIAGLTA ARVLSEHFAA VTILDRDALP TTVEERRNVP QAPHLHGLLF 
GGASALDELY GGFTVAMTDQ GAPPYDPQRD VAYFFPEGWI RRAPSDMRSV FGTRALTEHT
IRTLTRRLPN VRFRHAETIG LRRAGGDRVD AVVCDDLDGL GGLIEAELVV DAGGRASRSP
HWLADAGFDA PPESTVQPFL GYATVHCHLP EDALPGDLRA VCAPPAPHNT RGAFLLPEEN
NLYGLMAVGT SRDFPPGDPA GFDEFLRTAV TPVLHEMWQR AEPVTDIRTT RMSVNRLRRW
NELARRPQGF IAVGDAVAVY NPVYGQGMTA AVLQAVALRD RLRQADDLDT AVERLHDDVM
AVTSFAWQAA TASDLVFPVT ESRNMPAPTP EERAGAQYLG MVRATAVDDA YVAGEFYRAL
GLIRPEFLLA DEVRTRVERW VADPPAPTDD LSRPPAWADA DPPTRSRPPS GTAAVGHCRR
RALRNDGRRI PGRKAPPATR RRLRVLLYRG HPEGGVLSIV GRRPTLRSAS AATPRRTAAA
RRTAARPAPP RPAPLGPR