Gene Franean1_4508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4508 
Symbol 
ID5672857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5378942 
End bp5380561 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content64% 
IMG OID641243373 
Productdiguanylate cyclase with PAS/PAC sensor 
Protein accessionYP_001508789 
Protein GI158316281 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCGACA TTCTAGGCCG GTCGAGATTC GATCTTATCG GCGGTACCTG GGACGCTCTG 
GAGCCTCCCC CTGACGGTAA CGGCTGGGAG GATGTGTCGC AGCTGCTGAA CGGCGAACGG
ATTGTGTTGC GGCATCTGAC CCGGTTCGCC CGCCTCGATG GCAGCGTGAT TCATATCGTG
ATAACGACCC GAGTTGAGAA GTACGCAGAC GAGCCCTGCT GTCTCGGCCA GATACTAGAC
GTGACCGACG CTGCCGTTAC TCGCGACCAA TTGCAGCTGA TATTGGAAAA CAGTCCTGTT
TCCATGGGGC TTACAGATCT ATCCGGCCGG ATCTTAGTAT CAGGTGGCGG CTCCGTTCCT
GCCCTGGCCG AGGACCTGGA GGCTGCTGGG CGGACGTCCA TCTTCGAGGT TTTCAAGCAC
ACCCCGGATA CGCTGACAAT GATGCGACGA GCGATGGACG GCGAGCCGTC CATCGGAATG
ATCCAGGCCT ACGGACACTC CTACGACCTC CACATTAGGC CGATCCGTAA TCCGGCCGGG
CAGGTCAGCC ACGTTGCTTC GATCGCCACA GACGTGACGG AGCGCGAACG AGCACATGAT
GAGCAGAGCA CCGTCACCGA ACTGGCTCAT CAGGCGTTGC AGATCGTGGA GCCGGAACGG
TTGTGGAACC ACGCCGCCGC CGTCCTAGCC AGCCGGCTGG ACGCGGCCGT CACCGTGCAT
ATGACCGAAC CAGGACACGG ACCTGACCTG ATCGCCGCGG TTGGAGTGCT GCCACCGGCT
CCCGTCACCG CTGCGGCTGT CCGGGACGCC CTCCTAATCG GGCCGCCGGA CCAGGGCAAC
GTAACAGGCG CGCGGAACCT CCGCCGGACG GGGCGCTGGT TAACTGTGTC CCTGCCGCTC
GGCCGACCCA ACGCGCCCGC CGCCATCCTC ACCATCCACC GGACGGACCG GGAGCTTCAC
GAAGACGACA GCCAGGAAAC GGCAGCCGAG CCGTTCACAC CTGGCGAGGT GGAATTCGTC
GATGCGGTCG CCAGTGTGCT TGGTGCCGCA GCGGTCCGGT TCGCGATGGA ACGCGAAGCC
AACTACCGTG CGTTGCACGA CAGCCTGACC GACCTCCCCA ACCGCGTCGC CCTGCTCGAC
CGGCTCAAGT GCAACCTCGA ACAAAGCCGG AGCGACGGTC TCCGTACCGG CGTCATATTC
ATCGATTTGG ACGGTTTTAA GATTGTAAAT GACAGCCTCG GTCATCTTGT CGGCGACGAC
GTCCTGCGTG AGGTAGCCGA CCGCCTGCGC GCCTCGGCCC GGCCGAATGA CGTCGTGGCC
CGGCTTGCCG GCGACGAGTT CGCGATCCTC TGCGAACAGG TCTCCACGAT CATGGAACTC
GAGCGAGCGG CGCACGGTGT CATCACCGCA CTTGCCAAGC CGATCGCGCT ACCGGAAGCC
GACGTTACAA TTACCGCCAG TGCCGGAGTG GCAATCTCGG GTACCGGCCT CGCCAATGCC
GATCGGCTCC TCAACGCCTC CGACGTCGCG ATGTACGTTG CGAAACGAGC CGGACCCGGC
CACTGCGTCG CTTACCAGCC GGCCATGCGC CTCGACCAGG CGACGTCACC GGCCCGGTAG
 
Protein sequence
MCDILGRSRF DLIGGTWDAL EPPPDGNGWE DVSQLLNGER IVLRHLTRFA RLDGSVIHIV 
ITTRVEKYAD EPCCLGQILD VTDAAVTRDQ LQLILENSPV SMGLTDLSGR ILVSGGGSVP
ALAEDLEAAG RTSIFEVFKH TPDTLTMMRR AMDGEPSIGM IQAYGHSYDL HIRPIRNPAG
QVSHVASIAT DVTERERAHD EQSTVTELAH QALQIVEPER LWNHAAAVLA SRLDAAVTVH
MTEPGHGPDL IAAVGVLPPA PVTAAAVRDA LLIGPPDQGN VTGARNLRRT GRWLTVSLPL
GRPNAPAAIL TIHRTDRELH EDDSQETAAE PFTPGEVEFV DAVASVLGAA AVRFAMEREA
NYRALHDSLT DLPNRVALLD RLKCNLEQSR SDGLRTGVIF IDLDGFKIVN DSLGHLVGDD
VLREVADRLR ASARPNDVVA RLAGDEFAIL CEQVSTIMEL ERAAHGVITA LAKPIALPEA
DVTITASAGV AISGTGLANA DRLLNASDVA MYVAKRAGPG HCVAYQPAMR LDQATSPAR