Gene Franean1_6792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6792 
Symbol 
ID5675105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8274330 
End bp8276087 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content71% 
IMG OID641245641 
Productprotein serine phosphatase with GAF(s) sensor(s) 
Protein accessionYP_001511032 
Protein GI158318524 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCCTGG ATGCCGCGGA CGCCGGCGAG GAGTACCTGG CGGCGTTGGA GGAGATCGAG 
AGCTACGCCC GTGCCGCCCG CCTGCTCACC CTGGAATCGC CGCCGCAGCA GGTGGCTTTC
CGTCGCTGGT ACGTGGGATC GCTGGTGACA CAGCTCCGGA TCGCTGCCCG GGGCGGCGTG
CCCGAGCCGG TGCGTACCTT CGAGCAGTAT CTGCTGGACA CTCTCGACGA GGTGGTCAGA
ACCCAACGCA CGGCCGAACG GGCCGCGCGG CTGCAGGGGG TCACCGCGGC CCTCGCTCTG
GCGACGACGC CGGAGAAAGT CGCTGAGGTG GTGGTTTCCG AAGGGGTGAC CGCCCTGGGC
GCGTTCGGGG GCGGCCTGCT GGTCGCGCGG GACGCTGGGC GGCTCGCGGT CCCCGGCACC
GTCGGGTACC GCGAGGAGGT GGTCTCGATG CTGCGTGCGG AACGGCTCGA CGACCGCCTG
CCGGCCGCCG ATGCGATCCG CGGCGGTATG CCGGTCTGGT TGGAATCCCG CCAGGACCGC
GACTCGCGCT ACCCGGAGCT GGCCGAGCGG GAACCGAATG TGGTGTCGAT GTGTGCGTTG
CCGCTGCTCG TCGGTGACCG TGTGCTCGGG GCGTTGCGGT TCTCCTTCGA CCACCCCCGG
CTTTTCGACG CTGACGAACG TGACTTTGTC CTGGCGCTGG CGGCGCAGAC CGCGCTGGCC
GTGGAACGGG CCCGGCTGTT CGCCGCCGAG CGCCGTGCGC TCGAACGCAG CGCTTTCCTC
GCGACGCTCG CCGACCGGAT CACCACCACG CTGGACCCAC GGCGCGTCCT CGAGCAGACC
ACCGACCTGT TGGTGCCCCG GTACGCCGAA CGGGCCGTCG CGGTCCTGAC GGACGAGCCG
ATGACCTCCA TCGGCGATCC CTCCCCGAGG GAACTTGCCC TGATGAGGGC GGTTACTCAG
ACCGGAACCA CGACCACCAC CACCACCACC GACGGCGACG GCGACGAGAC CGTACTGGTT
GTTCCGCTCA CCGTTGGCGG CCGTACGGTC GCCGTCCTCG CTGTGCTCTG TCACGAGGAC
AGCCCACGCG CCCGCGACGA GCAGGATGCC ATCGAGGAGG TCGCCCGGCG TGCGGCCGTC
GCGGTGGGCA ACGCTCAGCT GTACGAACAG GAACGGCGCA CCGCTCTCAC CCTGCAACGC
AGCCTTCTCC CGCAGCGACT GCCGACGATC GGTGGGCTGT CGTTCGCCTG GCGCTACCTT
CCTGGCAGCG CCGGCGCCCT GGTCGGTGGC GACTGGTACG ACGTCCTTCC CCTGGACGAC
GGTCGGGTGG CGCTTGTCAT CGGGGACGTC ATGGGGCACG GCATCCAGGC CGCCGCGACC
GTGGGCCAGC TGCGGGCCTC CGCCCGCGCA CACATCACCA CCGATCCCAG TCCGTCGGCC
GTACTGGCCC GGCTCGATGA GGCAGCGAAT CGTCTCGAGC AGGGACAGAT CGCCACGGCG
GCGCTTGCCG TGCTGGACCC CGCGAACGCG CGGCTCACCC TGGCATCCGC CGGGCACCTG
CCACCGCGGC AGCTTCCCCC GACCAGCTGT GCGACCAGGC CCTGGCCGTA CTCGGCCGGC
TGGCCGGCCA CGACGACGAC ACCGCCCTGC TCGCCGTCAT GGTCAACACC CTGGGCCCCG
CATGATCTCG CCGAGCTGGA GGTCATTGGC CCAACCGGCG ATCTCAGCGT GCTCGCGGTA
GTGGCGGACT CGCGGTAG
 
Protein sequence
MPLDAADAGE EYLAALEEIE SYARAARLLT LESPPQQVAF RRWYVGSLVT QLRIAARGGV 
PEPVRTFEQY LLDTLDEVVR TQRTAERAAR LQGVTAALAL ATTPEKVAEV VVSEGVTALG
AFGGGLLVAR DAGRLAVPGT VGYREEVVSM LRAERLDDRL PAADAIRGGM PVWLESRQDR
DSRYPELAER EPNVVSMCAL PLLVGDRVLG ALRFSFDHPR LFDADERDFV LALAAQTALA
VERARLFAAE RRALERSAFL ATLADRITTT LDPRRVLEQT TDLLVPRYAE RAVAVLTDEP
MTSIGDPSPR ELALMRAVTQ TGTTTTTTTT DGDGDETVLV VPLTVGGRTV AVLAVLCHED
SPRARDEQDA IEEVARRAAV AVGNAQLYEQ ERRTALTLQR SLLPQRLPTI GGLSFAWRYL
PGSAGALVGG DWYDVLPLDD GRVALVIGDV MGHGIQAAAT VGQLRASARA HITTDPSPSA
VLARLDEAAN RLEQGQIATA ALAVLDPANA RLTLASAGHL PPRQLPPTSC ATRPWPYSAG
WPATTTTPPC SPSWSTPWAP HDLAELEVIG PTGDLSVLAV VADSR