Gene Franean1_3494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3494 
Symbol 
ID5671865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4156666 
End bp4158048 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content69% 
IMG OID641242382 
Productnitrilotriacetate monooxygenase component A 
Protein accessionYP_001507802 
Protein GI158315294 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.748879 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGCCAA GGCGCGAGGA CCAGCTCACC CTGGTCGCGT TCATGCAGGC TTCGAACGTG 
TCGGTGTACT CGGGGTCGTG GCGGTACCCG AGCTCGGCGC ACGACTTCCT CGACCTCCGC
TACTACCAGC GGATAGCGCG GGTGCTCGAG GAGGGCACGT TCGACCTGAT GTTCTTCGAC
GACCGCCTGG CGATGCCGTC GATCTACAAC GCGTCTCCCG CCGACGCCGT CCGCTACGGC
GCCCGCCCGG TCAAGCTCGA TCTGACCGCG GTGCTGGGCG CCGCCGCGGC CGCGACGTCG
CACCTGGGCC TCGGAGCGAC CTACTCCACG ACGTACTACC CGCCGTTCCA CGTCGCGCGG
ACCTTCGCGA CGCTGGACCA CCTCAGCGGC GGGCGGGCGG CCTGGAACGT CGTCACCTCG
GTGAACGACT CCGAGGCCCG CAACTTCGGG GTCGACCAAC ACCTCGGTCA CGACGAGCGC
TACGACCGCG CCGAGGAGTT CATCGACGTC GTCACCCGGC TCTGGGACTC CTGGGAGGAC
GACGCCCTGG TGATGGACCG GGAGTCCGGC GTCTTCGCCG ACCCGGGCAA GGTCCACGAA
CTCGACCACC ACGGCAAGTA CTTCCACGTG CAGGGACCCC TCACCGTCCC GCGCCCGCCG
CAGGGCCGGC TGCCCATCAT CCAGGCCGGC CAGTCGGGCC GCGGCCAGCA GTTCGCCGCC
AGGTGGGCCG ACCTGATCTT CACCGCCGAC CCGAGTCAGA GCGTCGCCGC CGAGCACTAC
CGCAGCCAGA AGGAACTCGT CACAGCCGAG GGCCGGTCCG CCGACGCCGT CCGGATGCTC
CCGATGGCGT ACGTCATCGT CGGCGAGACC GAGACGATCG CCAGGGAGAA GGAGAACATC
TTCCGCGACG AGCTCGTCCA CCCGATGGCG TCGCTGACGC TACTCTCCGA GCTCACCAAC
CACGACTTCT CGGGTTACTC ACTCGACGAC GAGATCACCG ACGAGCTCAT CAACTCCGTC
TCCGGCATCC GCGGCCTCGT CCAGGGCGTG AAGAAGCACC TCGGCGGCGG CAAGATGACA
CTGCGGACGC TGGCGAACCA CCGCGCCACC CTGCTGCAGG GCCCGCGCTT CGTCGGCACC
GGCACACAGA TCGCCGACCA GATGCAGGAC TGGTTCGAGA CCTACTCCTG CGACGGTTTC
GTCCTCGCGG CCACCCACTT CCCCGGCGCG TTCGAGGACT TCGTCCGGCT GGTGGTGCCC
GAGCTGCGCC GCCGCGGACT GTTCCGCTCC CGCTACACCG GCTCGACCCT GCGCGAGAAC
CTGGGCCTGG CACGGCCCGC CAGCAGCTTC ACCGCGTCGG TCAGCAGCGG TGTCCGCCCC
TGA
 
Protein sequence
MSPRREDQLT LVAFMQASNV SVYSGSWRYP SSAHDFLDLR YYQRIARVLE EGTFDLMFFD 
DRLAMPSIYN ASPADAVRYG ARPVKLDLTA VLGAAAAATS HLGLGATYST TYYPPFHVAR
TFATLDHLSG GRAAWNVVTS VNDSEARNFG VDQHLGHDER YDRAEEFIDV VTRLWDSWED
DALVMDRESG VFADPGKVHE LDHHGKYFHV QGPLTVPRPP QGRLPIIQAG QSGRGQQFAA
RWADLIFTAD PSQSVAAEHY RSQKELVTAE GRSADAVRML PMAYVIVGET ETIAREKENI
FRDELVHPMA SLTLLSELTN HDFSGYSLDD EITDELINSV SGIRGLVQGV KKHLGGGKMT
LRTLANHRAT LLQGPRFVGT GTQIADQMQD WFETYSCDGF VLAATHFPGA FEDFVRLVVP
ELRRRGLFRS RYTGSTLREN LGLARPASSF TASVSSGVRP