Gene Franean1_3883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3883 
Symbol 
ID5675724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4619152 
End bp4620480 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content72% 
IMG OID641242763 
Productdiguanylate phosphodiesterase 
Protein accessionYP_001508180 
Protein GI158315672 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.385825 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00073239 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGGACGCGT GGGAAACATG CTGGCGAGTG CCGCGTGGGG AGGTGCACGT GACCGACGCC 
TATGGTCGCG GCGGCGCCAC ACCGCCGTCC CGGTGGGATC CGGTCGAACC GGACCCGGTG
GCCGGGCTAC TCGCCACCGC CCGTCGCCGC CTGGGCATGG ATCTCTGTTG GTTCTCCAGG
CTGATCGGGG GAGCCCAGGT GATCGAGGCG TGCGACGGCG ACGCCGCGGC CTTCGGTGTC
CACCCCGGCT CGACGGTGCA CGATCCCGGC CCGCACCGCT CGAATGCTCT AGCCGAACGT
GCTCTAGCCG AACGCCCGCC TCCGGTCGTC CGCAGGATCC GCCCGGAGGC GGTGACGTCC
GAGGCAGCGG GGCCGGCGAT GACCGGGGGG CCGCTGGCCG ACCGGCCCGA GATCGGCTCG
TACATCGGAG TTCCGGTGAC CCTGGCGGAC GGCCGTCCGT ACGGGATGTT GTGCTGCCTG
AGCCGCGACG CCGACGTGGC GACGCCCGGC CGCAAGGCGC GTTCGCTGGC CCTGCTGGCG
GAGGTGCTGT CCGCCTCGAT CTCCGACCGG CGGTCCGGCG GGGAGGACCG GGAGGCGGCG
TGGTGGCGGA TCTGGCGGCT GATCGAAAGC GGCGGCCCGA CGATGGTCTT CCAGCCGGTC
TTCGACCTGC CGTCGCTCGA CTGCGTGGGG GCTGAGGCGC TTGCCCGGTT CCCGCCCGGT
TCGGGTGGTG CCGAGCGCTG GTTCGCCGAC GCCGCCGCCG TGGGTCTCGG CCCCGCACTC
GAACTGTCCG CGATCCGGTC GGCGCTGCGC GCGTTCACCC GCCTGCCGCC TGAATTCGGG
CTCGGGGTCA ACGCCTCACC GGCCACGATC CTCTCGGGGC GGCTGGCGGA TGCGATCGCC
GATATCCCGG CCGACCGACT CGTTGTCGAG GTCACCGAGG GTGACAAGGT CGAGGATTAT
CTGTCGGTCC GCTGCGCGCT GGGCGTCCTG CGCCGTGAAG GAGTCCGGAT CGCCGTGGAC
GACGTCGGTG CCGGCTACGC CAGCCTGCAT CATCTTGTGC AGCTCCAGCC CGACTTCATC
AAGATGGATC AGTGTCTCAC CCGGCGGATC GACGCTGATC CGGCGCGGCG CGCGCTCGCC
GCCGCGCTGG TGCACTTCGC CCAGGAGACA GGCAGCCTGG TCCTCGCCGA GGGGGTCGAG
ACCGCGCGGG AGCTCGGTGT CCTGATCGGC ACGGGCGTGC ACCAGGCGCA GGGCCACTAC
CTCGCCCCAC CCGGTCCGCT GCCACTGCCC GCGAGCGCCA ACCGGACCCG CCCGCCCGAT
ACGGCTTAG
 
Protein sequence
MDAWETCWRV PRGEVHVTDA YGRGGATPPS RWDPVEPDPV AGLLATARRR LGMDLCWFSR 
LIGGAQVIEA CDGDAAAFGV HPGSTVHDPG PHRSNALAER ALAERPPPVV RRIRPEAVTS
EAAGPAMTGG PLADRPEIGS YIGVPVTLAD GRPYGMLCCL SRDADVATPG RKARSLALLA
EVLSASISDR RSGGEDREAA WWRIWRLIES GGPTMVFQPV FDLPSLDCVG AEALARFPPG
SGGAERWFAD AAAVGLGPAL ELSAIRSALR AFTRLPPEFG LGVNASPATI LSGRLADAIA
DIPADRLVVE VTEGDKVEDY LSVRCALGVL RREGVRIAVD DVGAGYASLH HLVQLQPDFI
KMDQCLTRRI DADPARRALA AALVHFAQET GSLVLAEGVE TARELGVLIG TGVHQAQGHY
LAPPGPLPLP ASANRTRPPD TA