Gene Franean1_6994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6994 
Symbol 
ID5675305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8518911 
End bp8521304 
Gene Length2394 bp 
Protein Length797 aa 
Translation table11 
GC content74% 
IMG OID641245840 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001511231 
Protein GI158318723 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.126059 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCGGG CGTCGCAGAG TAGCGCCGAC CCCCGCGACC ACGCCGCGAC GGGGCTACTC 
CATGTGATCG CCCGTCAGCA GCGGGAGCTG GCGGAGCTCC GGGCGCGGTC CGCAGCCGAC
GCTCTGATCG ACATGGCTGT CGGGGTCCTG GCCGAACGGC TGTGCTGCTC GGTCACCGAG
GCCGGTGACC AGCTGCGCCG TCTCGCATCC GCCTCGGGGG TCACCCAGGA CGAGTTCGCC
GCCGAGGTGC TCGGCGGGAC CGCCGGAACA CTCCATCGAC CCCCGCTCGA CGACGGCGCC
CGGCGCCGCC TGTGGCTGGC CGAGGCCGCG GTCAGCCACG CCACCGATGG CTGCCCGGTC
GCCGAGGCCG TGTTCACCGA GGTCCTGGCC CATCAGGGTG CCGTCGCCGT GGCCTTCTGG
ATGCTGGAAT CCGATGGCGC GCTCACCCTG GTGGGCGAGA CCGGCCTGGG GCCGCTCGAG
GCGAGCCGGT GGCAACGCAT CCCGCCGCAG CTGGACTGCG TGGCGATGCG CGCGGCCCGC
GGCGCCACGC CGCAGTGGTG GCCCGAGGGC TCACCGCCCG AGGCGAACGA TCCGACCGAC
GTCCCGACCG ACATCCCGAC CGACATCCCG GCGGGCGGCC GGGCGGGCGT CGGCTCGGTG
CCGCGGGTGG GTCCCTGGCT CGGGGCGCGG GCGGCGCTGC CGTTCCTGCA GGCAGGCGCC
GTCGTGGGCG CCGTGGAGAT CTGCTGGCCC GCCCCGCTCG CGGAGTTCAC CGCCGGGCTC
CGACGCGAGC TCGTCGCGCT CGCGGACCTG TGCCTGCGCG GCCTGCACCC GGGCGGATTC
TCCGCCGACA CCCGCTCGGC GCAGCCACAG ACCCTCGGCG GGCGAACGGC CTGGCTGCCC
GCCCTCCTCG ACGCCGTCGC CGGGTCGGTG CTGCTGGCCC ACGCGGTGCG TGCCGAGGAC
GGCGAGATCC TCGACTTCCA CGTCGACCAC GTGAACGAGG GCTTCGTCGA CCCGGCCGGC
AGGCACCCGG CCCAGCTGCG CGGCCGGCGC CTGCTCGACC TCTACCCGCT CATGGCCGCC
GACGCCGGCC TGCTCGAGCG GGCCGTGGCG GTCGTGCGTA CCGGAGAGCG GTACCAGGCC
GACGGCATCG CCCTGCTCGT CCTGGTCGAC GACGTGCTCG TCGCCGAGGA GCTCGACGTC
CGCATCGCCC CCTTCTTCGA CGGAGTCGTG ATCAGTTGGC GGCCGGTCGG CGGCGGTGAC
GCGGGCGGTT CCCTGGCCGG CCACCTGCAA CGCCTCGGCC GGTTCGGCGG CTGGCAGGAG
GACGTCCGCA CCGGGGCCGT GCGCTGGACC GACCACGTCT ACGAGCTCTT CCAGCGTGAC
CGCACGATGT CACCCGTCCC GCTCGACGAC CTCGATCCGC ACATCCACCC GGACGACCGG
GCGGCCGTGG ACCACCTGCG TGACGCGCTG TTGCGGCTCG GCCGGGCTGC CTCGGGATCG
TTCCGGGTCA TCGGCAGCGA CGGGACGCTG CGCCAGCTCC GCGCCTTCGC CGAGCCGGTG
ACCGACGCCG CCGGCGTGAC GGTCGCCGTC CGCGGCATCT ACCAGGACCT GACCAGCCAC
TACCGCACGC AGGTCGTTCT TGACGCCACC CGGGACCAGC TCGCCGATTC GGAGGCGCGG
GCCGACGGCC AGCATCAGCT CGCGCTCGCC CTGCAGCGGG CGATCCTGCC CAGCTCCGAG
GAGCCCGTCG ACCTCGGCGG CCTCGCGGTC GCCGTCCGCT ACCGCCCCGC CGAGCAGGAG
CACCTTGTCG GTGGCGACTG GTATGACGCG GTCACCCTGC CGACCGGCGA GGTCCTGCTC
GTCGTCGGTG ACGTCGCCGG GCACAGCATC AGGACCGTGG CCGGTATGGT GACCCTGCGC
AACAGCCTGC GCGGCCTCGC GGTCACCGGC GCCGGCCCCG GGCAGCTGCT GCGCTGGCTC
AACAACGTCA CCTACCACCT CGCCGACAAC ATCACCGCCA CCGTCATCTG TGGCCTTTAC
CGTCCCGCCG AGCGTGCGCT GCGCTGGGCC CGCGCCGGGC ATCTGCCGCC CGTGCTCGTC
CGGGACGGCT CCGCCCACGC CCTTCCGATG CCCGACGGGC TCCTGCTCGG CGTCGAACCC
GATGTCGACT ACCAGGAAAC CGTCCACGTT CTTGAGCCGG ACGACGTCCT GCTTCTCTTC
ACCGACGGCC TCATCGAACG CCGCGGAACC GGGCTGGACG AATCCCTCCA CGCGCTGCTG
CGGATCGCGG CCGCCCCGGG CTTCGACGTC TCGCACCGGG CCGATCACCT GCTCGCGCAC
ACCACCCCGG ACACCGACGA CGACACCTGC CTCATCGTCA TCCGACAGTC CTGA
 
Protein sequence
MSRASQSSAD PRDHAATGLL HVIARQQREL AELRARSAAD ALIDMAVGVL AERLCCSVTE 
AGDQLRRLAS ASGVTQDEFA AEVLGGTAGT LHRPPLDDGA RRRLWLAEAA VSHATDGCPV
AEAVFTEVLA HQGAVAVAFW MLESDGALTL VGETGLGPLE ASRWQRIPPQ LDCVAMRAAR
GATPQWWPEG SPPEANDPTD VPTDIPTDIP AGGRAGVGSV PRVGPWLGAR AALPFLQAGA
VVGAVEICWP APLAEFTAGL RRELVALADL CLRGLHPGGF SADTRSAQPQ TLGGRTAWLP
ALLDAVAGSV LLAHAVRAED GEILDFHVDH VNEGFVDPAG RHPAQLRGRR LLDLYPLMAA
DAGLLERAVA VVRTGERYQA DGIALLVLVD DVLVAEELDV RIAPFFDGVV ISWRPVGGGD
AGGSLAGHLQ RLGRFGGWQE DVRTGAVRWT DHVYELFQRD RTMSPVPLDD LDPHIHPDDR
AAVDHLRDAL LRLGRAASGS FRVIGSDGTL RQLRAFAEPV TDAAGVTVAV RGIYQDLTSH
YRTQVVLDAT RDQLADSEAR ADGQHQLALA LQRAILPSSE EPVDLGGLAV AVRYRPAEQE
HLVGGDWYDA VTLPTGEVLL VVGDVAGHSI RTVAGMVTLR NSLRGLAVTG AGPGQLLRWL
NNVTYHLADN ITATVICGLY RPAERALRWA RAGHLPPVLV RDGSAHALPM PDGLLLGVEP
DVDYQETVHV LEPDDVLLLF TDGLIERRGT GLDESLHALL RIAAAPGFDV SHRADHLLAH
TTPDTDDDTC LIVIRQS