Gene Franean1_4619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4619 
Symbol 
ID5672964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5508728 
End bp5509957 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content69% 
IMG OID641243480 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001508896 
Protein GI158316388 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGAGAGC CGAGGATCGA CTACGAGGCC GTTTTTCGGG CTCTCCCCAG CCCGACCTTG 
CTACTGACCC CTGAATTGAT CATACTTGCC GCCAACGAGA CATATCTTCA GGTGTCGGGG
CGCACACGCG AGAACCTGCT GGGACGCTAC CTGTTCGACG CTTTCCCGGA CAATCCGGAG
TACCGGTCCG CGTCAGCGGT GCGCGCACTT GGCGCGTCGC TACGGCGAGT GCTGGCCACC
GGGGAGCGCG ACACCATGGC GGTCCAGCGT TACGACGTGG AGGCACCAGC GCGTCCAGAT
GTGTTCGAGG AGCGGTACTG GAGCACGGTC AACACTCCGG TGCTTGATCC TGACGGGCAG
GTGACGTTGG TGGTGCACCG GGTGGAGGAG GCGACCGACC TCGTCCGCAT GTGCGCCGGA
GAAATGGGGG ACGACCGGCA GCACAGGCTG GAGGTGGAGC TGATGGCCCG TGCTCGGGAG
CTGCACGAAG GCAACGAGCA GCTGCGTCGA ACTCACGCTC GGGAACGCGA GGTGGCCCTG
GCACTCCAGG AAGCGATGCT GCCCACGCCC GCACCGACCG GGCACGTCAA GGCGGCAGTG
CGGTACCAGC CCGCCGCCAG CACGATGAAC GTGTGCGGTG ACTGGTACGA CCTGGTGGAG
CTATCCGAGG ACCGCGTCGC GGTGGCCGTG GGCGACGTCG TCGGCCACGG ACTGTCGGCC
ACGGGCACGA TGGGCCAACT GCGCAGCGCA CTGAGCGCCA TGGTCCGAGT GGCCGACGGA
CCCGCAGCCG CCCTGGACGT GCTGGACCTG TACGCACGGT CGGTAGAAGG CGCCGAGTCG
ACCACCGTCG TGCAGGCGGT CGCCGACTAC GACACGCTCA CCGTCACCTA CAGCCGGGCA
GGTCATCCGC CGCCGGCACT GGCGCACGTC GGCGGCGCCA TCGAGTTCCT CGACCAGACC
GTCGACCCGC CGCTGGGCGC GAGCCCGGAA CACCTGCCCC GGTCCCAGGC CACCGCGACG
TTCGCGGTCG GCGCCACGCT GACGCTGTAT ACCGACGGCC TGATCGAACG CCGCGGGGAG
AACATCGACG TCGGGTTGTC CCGTCTTGCC GCCAGCCTCC GCCGCAACCC GGGACTTGAT
CCGGAGGCAC TGGCCGATGC GCTGCTGGCC GAGGTGGGCG CGGACAGCGA GCCAGCCGAC
GACACCGCCG TCGTCGTCGT CCGACTCTGA
 
Protein sequence
MGEPRIDYEA VFRALPSPTL LLTPELIILA ANETYLQVSG RTRENLLGRY LFDAFPDNPE 
YRSASAVRAL GASLRRVLAT GERDTMAVQR YDVEAPARPD VFEERYWSTV NTPVLDPDGQ
VTLVVHRVEE ATDLVRMCAG EMGDDRQHRL EVELMARARE LHEGNEQLRR THAREREVAL
ALQEAMLPTP APTGHVKAAV RYQPAASTMN VCGDWYDLVE LSEDRVAVAV GDVVGHGLSA
TGTMGQLRSA LSAMVRVADG PAAALDVLDL YARSVEGAES TTVVQAVADY DTLTVTYSRA
GHPPPALAHV GGAIEFLDQT VDPPLGASPE HLPRSQATAT FAVGATLTLY TDGLIERRGE
NIDVGLSRLA ASLRRNPGLD PEALADALLA EVGADSEPAD DTAVVVVRL