Gene Franean1_3551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3551 
Symbol 
ID5671920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4210608 
End bp4213613 
Gene Length3006 bp 
Protein Length1001 aa 
Translation table11 
GC content75% 
IMG OID641242437 
Productcyclic nucleotide-binding protein 
Protein accessionYP_001507857 
Protein GI158315349 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2197] Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTAGGG AAAACACCGG CGGTCAAGAA CTGGACTTCC TGAACCGGGT TGGCTCGCTG 
GCCCTGGACA CCCGAGAATC CGACGGCGGC GCGGGGCTGG CGGCGCCCGA CGAGGACCGC
GCCGTCAGCA ACCAGCAGTG GCCGATGGCG GCTCGCGCCT CCGAGCTCGC CGCGGTGGTG
GGCGCCGTCC GGCGGGGAAC CAGCGTCGTT CTCGTCGGCG AGCCCGGGGT GGGCAAGACC
CGGCTGCTGC GGACCGCCCT GGCGGAGGTG GCGAACGCCG GTGATCCGAT CCGGCTGGTC
GACGCGCCGC ACGGGACCCA TCCCGACCTG CCGGGCTCGC TGCTGGCGGA GCTCTCGGCG
GCCTTCGACC TGCTGGGCCG GGACCCGCTC GAGTGCGGTC CCGCCACGGT CATCCGGCCG
CGCGACGGGC AGCACCCGGC CGGGGCCGCG GATGTCTGCC CGCAACGGAT GTTCGGCTCG
GCGGGCCTGG GCGCGACGCG TCCCGACCAC GGGCTCTCCG GCGTCCGGCG CGGCCCGGAG
GCGGGAGGTG GGAACAGTGC GGGCGCGCGC CGCGCGCCGG CGGCTGGCGC CGAGGGACGG
GTGGTGCTCG GTGTCGACGA CGCTCATCTC CTCGATTCCG AGCCGGCGGC GATGCTGCAC
CACCTGGTGG CCACCGGCCG GGTCACGCTC GTCGCCGCCG TCCGCTCCGG CGAGGACCAG
CACCGGGGCG TCAGCAGGCT CTGGATGGAG CGGCTCGCCG AGCGTTTCGA CCTGGCCGAG
TTCGACGAGA GCGGTGTGCA CGCGATGGTG CGGGCCCGGC TCGGCGGCTC CGTCGACGAG
TCGACGCTGG CCAGGCTGCA CCATCTGACC CGGGGCAACG CCCTCTACCT GTGTGAGCTG
GTGGGTCACG CCTTGGCCGA GGGAACCTTC GTCCAGGCGG ACGGGATCTG GCGGTGGTCC
GGTCTCGCGA GCAGCGGTGG CCGGCTCGCC GACCTGGTCC GGCTCCGGCT GGCCGACCTC
GAGCCGGACG AGGTCGAGCT GGTCGCCATG GTGGCGCTCG CAGAGCCCCT CGAGGCCGAC
CTACCCGTCG TGGCGGAGCT CGCCGCCGCG GCCGAGTCGC TCAACCGGCG CCGGATCATC
GTGGCCGAAG GGGTCGGCCG CCGCGTCCAG CTGCGCCTGT TCTATCCGCT GCACAGTGAG
GTGCTGGTCG CCTCCCTGCC GGAGCTGACC GCCCGGCGGC TGCGCGCCCG GCTGGCCGCG
GCGATCGAAC GCACGGGGCT GCGCCGGCGC ACCGACCTGT TACGGGCCGT CCGGCTGCGT
CTTGACGCGG GTCAGATGCC CGCGCCGGCG CACCTGATCG ACGCGGCGGA ACGGGCGGTG
GGCACCGGCG ACGTCGTGCT CGCCGAACGG CTGTGCCGGC TCGCGATCGC GGTGCGGGAG
CCGACGCTCT CCCGCAGCCG CATCGAGCTG CTGCTGGGCC GGGCCCTGTG CTCGCAGGGC
AGGCATTCGG ACGCCGAGGA CGTCTTCGGG CGGGGCTGTG ACCACGCCCC TCGGGAGGAG
CTGGCCGTCC TGGCCCGCAC CCGGGCCCTG AACATGGCCG GTGGGCTGGG GCGGATCGAC
GACGCCGAGG CGTTGCTCGC CGCCGTCCGC TCCGCGGTGC CCGCGGCGGA CGCGGCGAAG
CTGTCGGCCA CCCAGGCGGT CGTCTGGATG CTCGCGGATC GGCTGCCGGA GGCGCTGACG
CTCGTGGGGT CCGCCCTCGC GGGGGAGAGC GCGGACTCAC CGCTGAGGCG CGAGGCCGTC
CCGGTCATGG CGGTCGCGCG CACCGAGCTC GGCGACGCGG CCGGTGCCCT CGAGCTGCTG
GACAGCTGCC TGCCGGCACT CGACCGGTGG GCTGATCACC ACTGGCTGCC GCACCGGCTG
GCGACGGTGG CCGCCAGCGT CGCGCTCGGA CGGATGGGCG ACGCTTCGGC GACGCTGCGG
CGGGTGCGCC GCCGGCTGGC CGACGGCCTG TCCCGGGTGA TGTGGGATCC GCTGACCGTG
CTCGTCGAGG CCCACCACCT GCGGCTGACC GGCCAGAGCG CCGAGGCTCT CGACCTGCTG
CGCCGCTCCG ACGACTCCGA CGCCGCCGCG AGCATCCCGG ACATCCGTTG CTGGACGCGT
GCCCAGGTGG CGGGTGCCCT CGCGGAGTCC GGCTCGCACG CGGACGCGCT GATGGCGATC
GCCGAGGCCC GCGCGCTGGT GGCCGCGACC GGCGGCTCCG CCACCGCCCG GGGGTGGGTG
ACGATCGAGG AGGTGACCGT GCACGCGCAT GCCGGTGACC GGGCGCGGGC CGTCTCTCTC
GCGCTGGAAC TGGCCGACCA CTTCGTGGCC GGCGGGCGGA TCGTCCGCGC CGTCGAGGCG
CTGCACCTCG CCGCCCGGCT GGGCGTCGCG AACGCCGTAG TGGCCCGATG CGAGGCGCTG
GCGGCCCGGA TCGGGACGGC CGACGTCGCG CAGGTTCGGT CCGCTCATGT CCGCGCCCTG
GCCGGCGCCG ACGGTGACGC GCTGAGCGGC GTCTCCCGCC GCTTCGAGGA CATGTGCCTG
CTGCCGCTGG CCGCGGAGAC CGCCGTCCAG GCGGCCGCCG CGTATCAGAT GCGCGGTGCG
GTCCGGAGAG GGCGGCTGGC TAGAGTCCGC GGCGCCGATC TTGTCTCCCG GTACGGGGGC
CGGCTGCCGC CCTGGGCGTC GGACGAGGTC AGCGTCGCCG CCGGTGCCGC GGCGACGCGG
GGCGGGGTGA GAGCGCACGG GGTTCCCGTC CCGGAGCTCA CCCCGAGGGA ACGCGAGGTC
GCGGCGTTCG CGGCCGTTGG GCTGTCGAAC CGGGAGATCG CGTCCCGCCT GGTGGTGTCC
GTGCGGACCG TCGAGAACCA CCTGCAACGC GCGTACGGCA AGCTCGGAGT GCTGCGTCGC
GCCGATCTGG CGCTCCGGCT GCAGGAGAGC CTGGAATCGG GCTTCGCGTC GGGGTCGGCG
ACCTGA
 
Protein sequence
MRRENTGGQE LDFLNRVGSL ALDTRESDGG AGLAAPDEDR AVSNQQWPMA ARASELAAVV 
GAVRRGTSVV LVGEPGVGKT RLLRTALAEV ANAGDPIRLV DAPHGTHPDL PGSLLAELSA
AFDLLGRDPL ECGPATVIRP RDGQHPAGAA DVCPQRMFGS AGLGATRPDH GLSGVRRGPE
AGGGNSAGAR RAPAAGAEGR VVLGVDDAHL LDSEPAAMLH HLVATGRVTL VAAVRSGEDQ
HRGVSRLWME RLAERFDLAE FDESGVHAMV RARLGGSVDE STLARLHHLT RGNALYLCEL
VGHALAEGTF VQADGIWRWS GLASSGGRLA DLVRLRLADL EPDEVELVAM VALAEPLEAD
LPVVAELAAA AESLNRRRII VAEGVGRRVQ LRLFYPLHSE VLVASLPELT ARRLRARLAA
AIERTGLRRR TDLLRAVRLR LDAGQMPAPA HLIDAAERAV GTGDVVLAER LCRLAIAVRE
PTLSRSRIEL LLGRALCSQG RHSDAEDVFG RGCDHAPREE LAVLARTRAL NMAGGLGRID
DAEALLAAVR SAVPAADAAK LSATQAVVWM LADRLPEALT LVGSALAGES ADSPLRREAV
PVMAVARTEL GDAAGALELL DSCLPALDRW ADHHWLPHRL ATVAASVALG RMGDASATLR
RVRRRLADGL SRVMWDPLTV LVEAHHLRLT GQSAEALDLL RRSDDSDAAA SIPDIRCWTR
AQVAGALAES GSHADALMAI AEARALVAAT GGSATARGWV TIEEVTVHAH AGDRARAVSL
ALELADHFVA GGRIVRAVEA LHLAARLGVA NAVVARCEAL AARIGTADVA QVRSAHVRAL
AGADGDALSG VSRRFEDMCL LPLAAETAVQ AAAAYQMRGA VRRGRLARVR GADLVSRYGG
RLPPWASDEV SVAAGAAATR GGVRAHGVPV PELTPREREV AAFAAVGLSN REIASRLVVS
VRTVENHLQR AYGKLGVLRR ADLALRLQES LESGFASGSA T