Gene Franean1_6225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6225 
Symbol 
ID5674544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7560592 
End bp7562733 
Gene Length2142 bp 
Protein Length713 aa 
Translation table11 
GC content70% 
IMG OID641245077 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001510473 
Protein GI158317965 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCGGAT CTAGCCGGCA CGGCCGCGAG AACTCGGCCG CCGCTCTGGA TCCACCCGAC 
GGAACTGACC CGGGCGCTCC GCCCGGGTCC GAGACCGGGC TGTTCACCTC GCAGCTGTTC
GAGCTGGCCC CGATCGGCCT GCTGATCACT GACCGGAAGC TGCGGGTACG CCAGATGAAC
CGCGCGATGC GAGCCATCAA TGGGTTGTCC GCCAGCGACA TCGGGCAACG ACCGTTGAGC
GAGTTGCTCC CCGACGTCCA CCCCGGGGCC TGGGAGGTGA TCCGTCAGGT CCTTGCCACC
GGCGAGCCCA TCCTCAACGT GGAGATCACC GGTTCCACTC CTGCCCCCTG GCCAGGGCAG
CGGACTTGGC GGGTGTCCCA CTATCCGCTC CGCGGCGGGG ACGGGTCCGT CGCAGCGGTG
GGCGTCGCCC TGGTGGACGT GACCGACCAG CGGCGGGCGG AGGCCGCGCG GGACAAGGTG
GAACAGCGGT TGCGGCTGCT GGGCCGGGCC AGTGGGTTGA TCGGAGCGTC ACTGGACCTG
ACGGCGACGC TGGAGGGGAT GGTCGAGCTC ATCGTTCCCG AGTTCGCCGA CAACTGCGAC
CTGTACCTCA CCGAGGAGCC GCTGGACGCG ACCGCCCCGC CGGCCCGGCT GCCGCTGCGT
CTCACCGTCG CCGGGAACAC GCCGTTGCTG CCGCCGCCGG AGGCCCACCA GCTCCTGCCT
TCCGCCATCA CCTATGTCGA CCAGGACAAC CCAGCGCACC GTTCGCTCGC CACCCGCCGT
CCGGTCCTCT TCGACGTCGA CGAGCGGGTG ATCGCCAGCG TCGATCATCC CCAGCGGGAT
GAGCACTTCG AATACATCGC CGTTCGCACC GCGATCACTG TCCCGCTGCT GGTCGGCAAC
GAATACCACG GTTCGGTGTA CCTCGGTCTC GGGCAGTCCG GGCGGACCTA CATCGAGTAC
GACGTGCAGA CCGCGGCGGA GCTCGGCAGC CGGATCGCCA ATGCGGTCGC CAACGCTCGC
GCCTTCGACC GCCAGCGCAC GGCCGCAATC ACCCTGCAAC GCGGGCTGCT GCCCGGCGGG
TTGCCGGCTG TCGAGGGTCT GGACATCGAA TGGCGCTACG AGCCCGGCAC GGCAGGAACC
GAGGTCGGCG GCGACTGGTT CGACATCGTG CCGCTGTCGG CCGGCCGGGT CGCCCTCGTC
ATCGGCGACG TGATGGGCCG CGGTCTGGCC GCCGCCGCCG TGATGGGCCA GGTCCGCGCC
GCGGTCCGTG CCTTCGCCGC CCTCGACCTC CCGGCAGCGG ACGTCCTGAC GCACCTCGAC
AGCCTGGTGC AGAATATCGG TGCCGGGCCG GACGGTGCCC TCGTCAGCTG CGTCTACGCA
ATCTTCGAGC CGGCAACCGC CAGCATCACC GTCGCCAACG CGGGTCACCT CCCGCCCGCC
CTCGTCGATG CGTACACAGA CGCTCGACTG CTGGAGGAGC CGGAAGGAAT CATCCTCGGC
ATTGGCGGCC CACCGTGCAC CGAAGTTCGG TACCCGTTCC CGGCAGGCAG CACCCTCGCC
CTCTACACCG ACGGACTCAT CGAATCCCCG AAGATTGACA TCGGCCAGGG TGTCCGCCAG
CTGCAGGCGG CCCTCGTCGC CACCGGCAGC TTGCCCGCCA CGGCCGAGCG GCTGCTCACC
CTCATCGACC GCAGCGGCGG TTACGACGAC GACGTCGCCC TCCTCCTCGT CCGCGCCACC
GCACGCGCCA CCACCTGGAC AACCACCGTG GAGCCCGATC CGCGGGCCGC CAAAGCCGCC
CGCGACACCA CCGTCACCGC GTTGCGGCAG TGGGAGCTCA CCGACAGCGT CGACCTTGTG
GAGCTCCTCG TCAGCGAGCT CGTCACGAAC GCCATCCGGT ACGCGAAGAC CCCCAGCGAC
CTGACGCTAC GCCGCGGTCG CCACGCCCTC TACGTCGAAA TCGCCGATGG CGACAGCCGG
GTGCCGCGCC TGCTGAACCC TTCCGCCGAT GACGAGGGCG GTCGCGGGCT TCAGCTTGTC
GCCCAACTCG CCACCCAGTG GGGAGCTCGT CCCACCCGCA CCGGGAAGAC AGTCTGGTTC
GAACTCGACC TGACCGGCGC GGATATGAAA CGGTCAGAAT AA
 
Protein sequence
MGGSSRHGRE NSAAALDPPD GTDPGAPPGS ETGLFTSQLF ELAPIGLLIT DRKLRVRQMN 
RAMRAINGLS ASDIGQRPLS ELLPDVHPGA WEVIRQVLAT GEPILNVEIT GSTPAPWPGQ
RTWRVSHYPL RGGDGSVAAV GVALVDVTDQ RRAEAARDKV EQRLRLLGRA SGLIGASLDL
TATLEGMVEL IVPEFADNCD LYLTEEPLDA TAPPARLPLR LTVAGNTPLL PPPEAHQLLP
SAITYVDQDN PAHRSLATRR PVLFDVDERV IASVDHPQRD EHFEYIAVRT AITVPLLVGN
EYHGSVYLGL GQSGRTYIEY DVQTAAELGS RIANAVANAR AFDRQRTAAI TLQRGLLPGG
LPAVEGLDIE WRYEPGTAGT EVGGDWFDIV PLSAGRVALV IGDVMGRGLA AAAVMGQVRA
AVRAFAALDL PAADVLTHLD SLVQNIGAGP DGALVSCVYA IFEPATASIT VANAGHLPPA
LVDAYTDARL LEEPEGIILG IGGPPCTEVR YPFPAGSTLA LYTDGLIESP KIDIGQGVRQ
LQAALVATGS LPATAERLLT LIDRSGGYDD DVALLLVRAT ARATTWTTTV EPDPRAAKAA
RDTTVTALRQ WELTDSVDLV ELLVSELVTN AIRYAKTPSD LTLRRGRHAL YVEIADGDSR
VPRLLNPSAD DEGGRGLQLV AQLATQWGAR PTRTGKTVWF ELDLTGADMK RSE