Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3551 |
Symbol | |
ID | 5671920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4210608 |
End bp | 4213613 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641242437 |
Product | cyclic nucleotide-binding protein |
Protein accession | YP_001507857 |
Protein GI | 158315349 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG2197] Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTAGGG AAAACACCGG CGGTCAAGAA CTGGACTTCC TGAACCGGGT TGGCTCGCTG GCCCTGGACA CCCGAGAATC CGACGGCGGC GCGGGGCTGG CGGCGCCCGA CGAGGACCGC GCCGTCAGCA ACCAGCAGTG GCCGATGGCG GCTCGCGCCT CCGAGCTCGC CGCGGTGGTG GGCGCCGTCC GGCGGGGAAC CAGCGTCGTT CTCGTCGGCG AGCCCGGGGT GGGCAAGACC CGGCTGCTGC GGACCGCCCT GGCGGAGGTG GCGAACGCCG GTGATCCGAT CCGGCTGGTC GACGCGCCGC ACGGGACCCA TCCCGACCTG CCGGGCTCGC TGCTGGCGGA GCTCTCGGCG GCCTTCGACC TGCTGGGCCG GGACCCGCTC GAGTGCGGTC CCGCCACGGT CATCCGGCCG CGCGACGGGC AGCACCCGGC CGGGGCCGCG GATGTCTGCC CGCAACGGAT GTTCGGCTCG GCGGGCCTGG GCGCGACGCG TCCCGACCAC GGGCTCTCCG GCGTCCGGCG CGGCCCGGAG GCGGGAGGTG GGAACAGTGC GGGCGCGCGC CGCGCGCCGG CGGCTGGCGC CGAGGGACGG GTGGTGCTCG GTGTCGACGA CGCTCATCTC CTCGATTCCG AGCCGGCGGC GATGCTGCAC CACCTGGTGG CCACCGGCCG GGTCACGCTC GTCGCCGCCG TCCGCTCCGG CGAGGACCAG CACCGGGGCG TCAGCAGGCT CTGGATGGAG CGGCTCGCCG AGCGTTTCGA CCTGGCCGAG TTCGACGAGA GCGGTGTGCA CGCGATGGTG CGGGCCCGGC TCGGCGGCTC CGTCGACGAG TCGACGCTGG CCAGGCTGCA CCATCTGACC CGGGGCAACG CCCTCTACCT GTGTGAGCTG GTGGGTCACG CCTTGGCCGA GGGAACCTTC GTCCAGGCGG ACGGGATCTG GCGGTGGTCC GGTCTCGCGA GCAGCGGTGG CCGGCTCGCC GACCTGGTCC GGCTCCGGCT GGCCGACCTC GAGCCGGACG AGGTCGAGCT GGTCGCCATG GTGGCGCTCG CAGAGCCCCT CGAGGCCGAC CTACCCGTCG TGGCGGAGCT CGCCGCCGCG GCCGAGTCGC TCAACCGGCG CCGGATCATC GTGGCCGAAG GGGTCGGCCG CCGCGTCCAG CTGCGCCTGT TCTATCCGCT GCACAGTGAG GTGCTGGTCG CCTCCCTGCC GGAGCTGACC GCCCGGCGGC TGCGCGCCCG GCTGGCCGCG GCGATCGAAC GCACGGGGCT GCGCCGGCGC ACCGACCTGT TACGGGCCGT CCGGCTGCGT CTTGACGCGG GTCAGATGCC CGCGCCGGCG CACCTGATCG ACGCGGCGGA ACGGGCGGTG GGCACCGGCG ACGTCGTGCT CGCCGAACGG CTGTGCCGGC TCGCGATCGC GGTGCGGGAG CCGACGCTCT CCCGCAGCCG CATCGAGCTG CTGCTGGGCC GGGCCCTGTG CTCGCAGGGC AGGCATTCGG ACGCCGAGGA CGTCTTCGGG CGGGGCTGTG ACCACGCCCC TCGGGAGGAG CTGGCCGTCC TGGCCCGCAC CCGGGCCCTG AACATGGCCG GTGGGCTGGG GCGGATCGAC GACGCCGAGG CGTTGCTCGC CGCCGTCCGC TCCGCGGTGC CCGCGGCGGA CGCGGCGAAG CTGTCGGCCA CCCAGGCGGT CGTCTGGATG CTCGCGGATC GGCTGCCGGA GGCGCTGACG CTCGTGGGGT CCGCCCTCGC GGGGGAGAGC GCGGACTCAC CGCTGAGGCG CGAGGCCGTC CCGGTCATGG CGGTCGCGCG CACCGAGCTC GGCGACGCGG CCGGTGCCCT CGAGCTGCTG GACAGCTGCC TGCCGGCACT CGACCGGTGG GCTGATCACC ACTGGCTGCC GCACCGGCTG GCGACGGTGG CCGCCAGCGT CGCGCTCGGA CGGATGGGCG ACGCTTCGGC GACGCTGCGG CGGGTGCGCC GCCGGCTGGC CGACGGCCTG TCCCGGGTGA TGTGGGATCC GCTGACCGTG CTCGTCGAGG CCCACCACCT GCGGCTGACC GGCCAGAGCG CCGAGGCTCT CGACCTGCTG CGCCGCTCCG ACGACTCCGA CGCCGCCGCG AGCATCCCGG ACATCCGTTG CTGGACGCGT GCCCAGGTGG CGGGTGCCCT CGCGGAGTCC GGCTCGCACG CGGACGCGCT GATGGCGATC GCCGAGGCCC GCGCGCTGGT GGCCGCGACC GGCGGCTCCG CCACCGCCCG GGGGTGGGTG ACGATCGAGG AGGTGACCGT GCACGCGCAT GCCGGTGACC GGGCGCGGGC CGTCTCTCTC GCGCTGGAAC TGGCCGACCA CTTCGTGGCC GGCGGGCGGA TCGTCCGCGC CGTCGAGGCG CTGCACCTCG CCGCCCGGCT GGGCGTCGCG AACGCCGTAG TGGCCCGATG CGAGGCGCTG GCGGCCCGGA TCGGGACGGC CGACGTCGCG CAGGTTCGGT CCGCTCATGT CCGCGCCCTG GCCGGCGCCG ACGGTGACGC GCTGAGCGGC GTCTCCCGCC GCTTCGAGGA CATGTGCCTG CTGCCGCTGG CCGCGGAGAC CGCCGTCCAG GCGGCCGCCG CGTATCAGAT GCGCGGTGCG GTCCGGAGAG GGCGGCTGGC TAGAGTCCGC GGCGCCGATC TTGTCTCCCG GTACGGGGGC CGGCTGCCGC CCTGGGCGTC GGACGAGGTC AGCGTCGCCG CCGGTGCCGC GGCGACGCGG GGCGGGGTGA GAGCGCACGG GGTTCCCGTC CCGGAGCTCA CCCCGAGGGA ACGCGAGGTC GCGGCGTTCG CGGCCGTTGG GCTGTCGAAC CGGGAGATCG CGTCCCGCCT GGTGGTGTCC GTGCGGACCG TCGAGAACCA CCTGCAACGC GCGTACGGCA AGCTCGGAGT GCTGCGTCGC GCCGATCTGG CGCTCCGGCT GCAGGAGAGC CTGGAATCGG GCTTCGCGTC GGGGTCGGCG ACCTGA
|
Protein sequence | MRRENTGGQE LDFLNRVGSL ALDTRESDGG AGLAAPDEDR AVSNQQWPMA ARASELAAVV GAVRRGTSVV LVGEPGVGKT RLLRTALAEV ANAGDPIRLV DAPHGTHPDL PGSLLAELSA AFDLLGRDPL ECGPATVIRP RDGQHPAGAA DVCPQRMFGS AGLGATRPDH GLSGVRRGPE AGGGNSAGAR RAPAAGAEGR VVLGVDDAHL LDSEPAAMLH HLVATGRVTL VAAVRSGEDQ HRGVSRLWME RLAERFDLAE FDESGVHAMV RARLGGSVDE STLARLHHLT RGNALYLCEL VGHALAEGTF VQADGIWRWS GLASSGGRLA DLVRLRLADL EPDEVELVAM VALAEPLEAD LPVVAELAAA AESLNRRRII VAEGVGRRVQ LRLFYPLHSE VLVASLPELT ARRLRARLAA AIERTGLRRR TDLLRAVRLR LDAGQMPAPA HLIDAAERAV GTGDVVLAER LCRLAIAVRE PTLSRSRIEL LLGRALCSQG RHSDAEDVFG RGCDHAPREE LAVLARTRAL NMAGGLGRID DAEALLAAVR SAVPAADAAK LSATQAVVWM LADRLPEALT LVGSALAGES ADSPLRREAV PVMAVARTEL GDAAGALELL DSCLPALDRW ADHHWLPHRL ATVAASVALG RMGDASATLR RVRRRLADGL SRVMWDPLTV LVEAHHLRLT GQSAEALDLL RRSDDSDAAA SIPDIRCWTR AQVAGALAES GSHADALMAI AEARALVAAT GGSATARGWV TIEEVTVHAH AGDRARAVSL ALELADHFVA GGRIVRAVEA LHLAARLGVA NAVVARCEAL AARIGTADVA QVRSAHVRAL AGADGDALSG VSRRFEDMCL LPLAAETAVQ AAAAYQMRGA VRRGRLARVR GADLVSRYGG RLPPWASDEV SVAAGAAATR GGVRAHGVPV PELTPREREV AAFAAVGLSN REIASRLVVS VRTVENHLQR AYGKLGVLRR ADLALRLQES LESGFASGSA T
|
| |