Gene Franean1_0658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0658 
Symbol 
ID5669075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp770370 
End bp772220 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content76% 
IMG OID641239585 
ProductIucA/IucC family protein 
Protein accessionYP_001505023 
Protein GI158312515 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4264] Siderophore synthetase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.577175 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.913499 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCACCG GGCTTCCTAG AGCCGCCGCG GGGCGGTGGC GCGCTGCCGG GCTGGCGCTG 
CTCACCCGGC TGATCGCGGA GCTCGCCTAC GAGGAGCTGC TCGTTCCGCG GGCCGAGCCG
GCGACCCCGC CGCGCACCCC GCCGCCGGTC GGGCGCGGCG CGCCGGCGCC GTACCGGATC
GAGTGCGGCC CCGTCACGTA CACGTTCACC GCCCGGCGCG GCACGTTCGG CACCTGGTGG
CCCGACCCGG CGACGCTGCG CCGCGACGGC GAGCCGGCCT GGGACCCGGC CCGGTTCCTG
CTCGACACCC GGCAGGGCCT GGGCTGGTCC GGGGACGTCC TGACCGATGT CGTGCGCGAG
GTGACGGCCA CCCAGCGCGC CGACGCCGAG ATCCTGCGGA CGGCCCTGCC GGCCCGCGCG
CTCGCCGACC TCTCCCACCT TGAGCTGGAG GGGCACCAGA CCGGCCATCC CTGCATGATC
GCGAACAAGG GGCGGCTCGG GTTCGACGCG GCCGACGCGG CGCGGTACGC CCCGGAGTCG
CGCCGCCCGT TCCGGCTGCG CTGGGTGGCC GCCCACGCCG AGCTGGCGCG GCTCGTCACC
GGCCCCGGGC TGGGCGCCAC CGGGCTGCGC GAGGCCGAGC TCTCCGCCGC CACCAGAGCC
GAGTTCGGCG CGGTCCTGCG CGCCGCGCTC GAGCCCACCG CCGGAGCGGA CGCCGCCGCG
TTGGCCGAGC GGTACGTGTG GCTGCCGGTG CACCCGTGGC AGTGGGAGAA CGTGGTCGCG
CCGATGTTCG CCGGCCCGCT CGCCACCGGG CAGCTGGTCG ACCTCGGCGA GGCGCCCGAC
CGCTACCTGC CGTTGCAGTC CGTGCGCACG GTGGCGAACA TCGACACCCC GGGCCGGCGC
GACGTCAAGC TCGCGCTGAT GATCCGGAAC ACGCTGGTGT GGCGCGGGAT GTCCGCCGCG
GACGCCACCG CCGGACCGGC CGTCTCGGCG TGGCTGGTGT CGCTCGCCCA CGGTGATCCG
GTGCTGCGGG CCACCGGCGT CGTCGTGCTG CCCGAGATCG CCGGAGCCAC CGTCGCGCAT
CCCGCCTTCG ACGCGGTGCC GGACGCCCCC TACCGGCTGC ACGAGCTGCT CGGGGTGCTC
TGGCGGGAGC CGGTCGCGTC CTTCCTGGCC CCCGACGAGC GGGCCCGGAC GATGGCCTGC
CTGCTCACCG TCGGCGCGGA CGGCGAGTCG CTGGCCGCCG AGCTCGTCCG CCGCTCCGGG
CTGGAACCGG CGCGGTGGCT CGCCGCGCTG CTGGACGCGC TGCTCCCGCC GCTGCTGCAC
TACCTGTACG TGTACGGGGT GGCGTTCACC CCGCACGGCG AGAACGTGAT CTGCGTGTTC
GACGCCGGTG GGATCCCGCG GCGGATCGCG GTGAAGGACT TCGGCGCGGA CATCGACCTC
GTCGAGGGCG AGTTCCCCGA ACGGGCGGCG ACGGACGGCG GCGCCGGCGC GCTGTGCCGC
CACTGGCCGG GCCGGATGCT CGCCCACTCG GTGCTCTCGG CAGTGTTCGC CGGCCACTTC
CGCTACTTCT CGGTGATCGC CGCCGACCAT CTCGGCGTGC CCGAGGAGGA GTTCTGGTCG
CTGGTGCGCG GCGCCGTGGA GGACTACCAG GAGGCCCACC CGGAGTACGC CGAGCGGTTC
GCCGCGGTCG ACCTGCTGAC CCCGTCCTTC GAGCGGGTCT GCCTCAACCG GGAGCAGTTC
GCCGGTGCCG GCTTCCACGA CCGCTCCGGG CGGGACGCCC AGTTCGACGT CCTGCACGGC
ACGGTGGCGA ACCCGCTGAT GCTCGCCCCA CCGCGGCGGA GCCACGGGTG A
 
Protein sequence
MTTGLPRAAA GRWRAAGLAL LTRLIAELAY EELLVPRAEP ATPPRTPPPV GRGAPAPYRI 
ECGPVTYTFT ARRGTFGTWW PDPATLRRDG EPAWDPARFL LDTRQGLGWS GDVLTDVVRE
VTATQRADAE ILRTALPARA LADLSHLELE GHQTGHPCMI ANKGRLGFDA ADAARYAPES
RRPFRLRWVA AHAELARLVT GPGLGATGLR EAELSAATRA EFGAVLRAAL EPTAGADAAA
LAERYVWLPV HPWQWENVVA PMFAGPLATG QLVDLGEAPD RYLPLQSVRT VANIDTPGRR
DVKLALMIRN TLVWRGMSAA DATAGPAVSA WLVSLAHGDP VLRATGVVVL PEIAGATVAH
PAFDAVPDAP YRLHELLGVL WREPVASFLA PDERARTMAC LLTVGADGES LAAELVRRSG
LEPARWLAAL LDALLPPLLH YLYVYGVAFT PHGENVICVF DAGGIPRRIA VKDFGADIDL
VEGEFPERAA TDGGAGALCR HWPGRMLAHS VLSAVFAGHF RYFSVIAADH LGVPEEEFWS
LVRGAVEDYQ EAHPEYAERF AAVDLLTPSF ERVCLNREQF AGAGFHDRSG RDAQFDVLHG
TVANPLMLAP PRRSHG