Gene Franean1_6420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6420 
Symbol 
ID5674735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7795709 
End bp7797331 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content76% 
IMG OID641245268 
ProductAraC family transcriptional regulator 
Protein accessionYP_001510663 
Protein GI158318155 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2169] Adenosine deaminase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.178925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.589237 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGCCA GTCCGCATCT CGACGCCGAG CTGTGCGCGC GGGTGGTGCG GTCCCGGGAC 
GCCCGGTTCG ACGGGTGGTT CTTCGTCGCG GTGACCTCCA CCGGCATCTA CTGCCGGCCG
AGCTGCCCGG CGCGGCCACC CAGGACCGAG AACATGCGGT TCCACCCGAC CGCCGCGTCC
GCCCAGCGCG CCGGGTTCCG GGCCTGCAAG CGCTGCCGCC CGGACGCGAG TCCCGGCTCA
CCGGAGTGGA ACCACCGCGC CGACGTCGTG GCGCGCGCGA TGCGGCTCAT CGTGGACGGC
GTCGTCGACC GTGAGGGTGT CCCCGGGCTT GCCGGTCAGC TCGGCTACAG CGCACGCCAG
GTGGAGCGGC ACCTGTCAGC CGAGCTCGGT GCCGGGCCGC TCGCGCTGGC CCGCGCCCAG
CGCGCCCAGA CCGCCCGGCT CCTGCTCGAG ACCACGGACC TGAGCATGGG CGACGTCGCG
ACCGCCGCCG GCTTCAACAG CATCCGCGCC TTCAACGACA CCATGCGCGA GGTCTTCGCC
GCCGCGCCGA GCGATCTGCG ACGCCGTCGC AACCCGGCGA GGCCCCCGTC CGTCGTGCCG
GTACCGAGGG CGGCTACCGG TGCCGTGACG TTGACCGTCC GGCTGCCGTT CCGGGCCCCG
CTGTATCCCG ACAACCTGTT CGGGCACCTC GTCGCGACGG CCGTTCCCGG GGTCGAGGAG
TGGCGGGACG GTGCCTACCG GCGCACGATG CGCACGCTGC ACGGGCACGC GATCGTCGCC
CTGCGGCCGC TGCCCGACCA CATCGGCTGC CGGCTCGCCC TCACCGACGT GCGCGACCTC
GCGCCGGTCA TCGGCCGCTG CCGCCGGCTG CTCGACCTCG ACGCGGACCC GATCGCCGTC
GACGGGCAGC TCGCCGCCGA CCCGGCGCTG GCGCCGCTGG TCGCGCGGGC ACCGGGCCGG
CGTGTTCCGC GCACCGTCGA CCCGGCCGAG CTCGCGGTGC GCGCAGTCCT CGGACAGCAG
GTCTCCGTCG CGGCGGCGCG GACCCACGCC GCGCGGCTCG TCACGGCCGT CGGCACGCCG
ATCCATGATC CGGAAGGCGG CCTCACCCAC CTCTGGCCAC AGATCGCGGA CCTCGCCGAG
CACATCGAGC GCACCGAGTA CGCCGAGTGC ACCGACCTCG CGGACGCTGT CCCGGCCGGC
CGCCGGGCCG GGGCGCCGCG CGGGCTCGCC CTGCCGGCCG CCCGGCGGCG GACCTTCGCC
GCGCTGGTCG GCGGGCTGGT GTCCGGCATG ATCGAGCTGG GTGCGGGCGG AGACTGGGAG
CGGGCCCGCG CCGCGCTGGC GGCTCTGCCC GGCATCGGCC CGTGGACGCT CGAGACCATC
GCGATGCGGG CCCTCGGTGA CCCGGACGCA TTCCTGCCCG GTGATCTCGG TGTCCGCCGA
GGGGCCGAAC GGCTCGGTCT GCCGGCCACC CCCGCCGCGC TGTCCCGGCA TGCCGCCGCC
TGGCGCCCCT GGCGGGCCTA TGCCGTCCAG CACCTGTGGG CGGTGCTCGA CCATCCAGTC
AACCGGATGC CCGCGCCGGA TCATCCCGGC CCGGTCACCG CCCGCCGAGA GGAACGCCTG
TGA
 
Protein sequence
MPASPHLDAE LCARVVRSRD ARFDGWFFVA VTSTGIYCRP SCPARPPRTE NMRFHPTAAS 
AQRAGFRACK RCRPDASPGS PEWNHRADVV ARAMRLIVDG VVDREGVPGL AGQLGYSARQ
VERHLSAELG AGPLALARAQ RAQTARLLLE TTDLSMGDVA TAAGFNSIRA FNDTMREVFA
AAPSDLRRRR NPARPPSVVP VPRAATGAVT LTVRLPFRAP LYPDNLFGHL VATAVPGVEE
WRDGAYRRTM RTLHGHAIVA LRPLPDHIGC RLALTDVRDL APVIGRCRRL LDLDADPIAV
DGQLAADPAL APLVARAPGR RVPRTVDPAE LAVRAVLGQQ VSVAAARTHA ARLVTAVGTP
IHDPEGGLTH LWPQIADLAE HIERTEYAEC TDLADAVPAG RRAGAPRGLA LPAARRRTFA
ALVGGLVSGM IELGAGGDWE RARAALAALP GIGPWTLETI AMRALGDPDA FLPGDLGVRR
GAERLGLPAT PAALSRHAAA WRPWRAYAVQ HLWAVLDHPV NRMPAPDHPG PVTARREERL