Gene Franean1_0399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0399 
Symbol 
ID5668823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp475659 
End bp478193 
Gene Length2535 bp 
Protein Length844 aa 
Translation table11 
GC content60% 
IMG OID641239332 
Producthypothetical protein 
Protein accessionYP_001504771 
Protein GI158312263 
COG category[T] Signal transduction mechanisms 
COG ID[COG5635] Predicted NTPase (NACHT family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCCATC TGGCATCACC CAATGGCGAC CTGACACCTG GCATGCGTGA AGCACTAAGC 
CGCATGCGCG AAATAAGAAG AAGGTTCGGC CCGGCATATG CGGCGATGGA GAAAAGGTCG
GGCGTAAGTC ATTCCAACTG GCACCGCTGG TTGCACGGAA AAGGGGTCAT GCCTTTGGAG
AGGGTGAGCG AGGCGAAAGG CATGCTGATG GGTTGCCTAG AAAGGGAGAT GAAGAAGCAT
TCCGGTGACG AGGATGTGCT ACGGACGCTA CGCGAGCTGA AAGAGTCGTT GGAGGAGGTG
ACAGCGCTGT GGATCCGAGA CCAAGGATCC ACTACCGCGC AAATGGTGGC CAATGAAATG
GTGACTAATG AGCAGCCCGA GCCCGATGTC CAGATCGAGA ACATATTGCC GGAACATGGT
CAGATGTTTT CGAGCGACTG GATGACTGAC ATGGCCGAGG AGCTCGCGGC GGCAGTCGAG
AGACAGTGGA TCTGGGAGGC CGACAGACGC GGATTGATCC GCCCTCCGGC GATCCCTGTA
CGATGGCGCT GGGCGCAGGG CATGACCAGC AAACTTGACG TGGTCCTCAA TAACTCACAC
CGGCGCACCC TGTTCCCTCC CCTACCGGGA GTGGCGTCGG CGACATCTGC GAGCTTGCAG
TCCGGCGGCC TCCAGCAGCT TTTCGATGCA TATGCCGGGG TGGACTCCGG CCGCCTTGTC
ATCATGGGCA AGTATGGCAG CGGAAAGTCT GCAACCGCCA TCTTGATGCT GCTCGACGCG
TTGTCACACC GTGACAGCCT CGACGACGTG CAACGGGCGA AGGTACCCGT GCCGATATTT
CTGACCACAC ACGACTGGGA CCCCCGACAC CAGAATCTTC CCGAATGGCT GATCACTCGA
CTTGCTAAGG AATACCAATT TCTCCGATCA ACTCAGGACG GCCGGAGTGC TGCCGCAAGA
TTGGTCGAAG ATGGTCGGGT TGCACTGTTC CTTGATGGCT TTGACGAAAT TCCATCCGAG
CTTCATCGGG ACGCGTTGGA GGAGATCCGC CGACTGGCGA CGTTCCGGCT GGTCACACTG
ACCCGCGACG GGGAGTTTGC CGCAGCCGTG CGTACGGGAC ACCATGTGGA CGCCGCCGCG
GTAGTGGAAT TGTTGCCGAT CCCGGCCGAT GAGATCGCTT CTTACCTTGA GACCCGTCAG
ACCGATCCCA TGCCCCCGCA GTGGGAGAAG CTTGTCACCT TTCTGCGAGA AAATCCGGAC
CATATCCTGG CGCAAGCGCT AGACTCGCCG CTCGTCTTGA CGCAACTGCG AGACGCAATC
CAAGATCCGG CTGACATCGA CGATCTCCTC GTCGCAGACA GGTTCGAGAG CCGAGAAGCG
GTCGCAGAAT ATCTGATGGA CCGGTCCATC GATGTCGCCT ATCGGCGATT CTCCCGCGAT
ACGTCCACCG TCGGTCCCGA CAAGGCGAGA GCTGCTCTAG GATATATAGC CGCGAAGATG
AACGAGGAAA ACACCCGCGA CCTGGCCTGG TGGCAAATTC ATCACTGGGC CTCGCCTATT
CCTCGGATCC TTGCGACGAC AGTTCTAGGC GTGCTGATAG GGGCACCTGT AGGCGCGTTG
ATGTTCGGTC CGCTCGGCCA GTATGCGGTG AGAGGGCATA CTGGAACGTT GTTTGGGGCC
CAGTATCTTT CCATGATGTG CCTTGTTTTT GGGCTGATGG CTGGACTCGT TTCGGAGGCC
CGCGGAGGCC GCTCCCGTCG AACAGGCCGA TTCAGATGGG TCGACTGGTA TCGCGGTCAG
ACTAACAGCG CGGTCGGTCT GCTGTTTACT GTTGCCGTCA CGATGGCCGT TGGTAACCAG
TCCAATTACG CCTTCGGGGC ATTGGCGGGG GTTCTAGCAG GGATCGTGGC CGGGTATGCT
GCGAGGGGCG ACCACCAAGA TCACAGGTGG ATACAGCGGT CTTGGTGGAT CACGCTTCGA
TCAAGGCTCG ACCCGGTCGC AGGGGCTGTA GCGGGATTGC CGATCGGACT GACGTATGGA
TTGACCAAAG AACATACTCA GGGCCTCGTG GCCGGTATCA TGAGCGCAAT CGCCTTCGGC
CTCATGGTCG GCTTCGCGCG ACCGACGGCT GGTATTCAGG CTGTTACCGA TCCACGAACA
TCCTGGCTCC GAAATCACGA ACATGCAGCC ACCTTCAGCC TGGCCGCTGG CCTAGCACTC
GGGCTTCCGC TTGGATTGAA AAACGGGCTG GAGCACGGCG TCATCGCCGG CGCCGTCGCC
GGCGTTTGCG TTGGGCTCAT CGTCGGACTT GGGTGTTTGA TCGGGGCGTC CGACAGGTTG
CGGACCACCC TGCTGTTTCT CCAGCTGCGC GGCCACGGCA TTCCTTTGGA CGGAATGCGT
TTCCTGGAGG ATGCGCGCCG GAAGAATCTT CTCCGTACCG TCGGACCGCT ATACCAGTTC
CGGCATCCCA GCATTCAAGA CCGACTCGCG AGGACATACG GGCAGCAGCA AACCCGCGTC
GATCCCGAAA TCTGA
 
Protein sequence
MPHLASPNGD LTPGMREALS RMREIRRRFG PAYAAMEKRS GVSHSNWHRW LHGKGVMPLE 
RVSEAKGMLM GCLEREMKKH SGDEDVLRTL RELKESLEEV TALWIRDQGS TTAQMVANEM
VTNEQPEPDV QIENILPEHG QMFSSDWMTD MAEELAAAVE RQWIWEADRR GLIRPPAIPV
RWRWAQGMTS KLDVVLNNSH RRTLFPPLPG VASATSASLQ SGGLQQLFDA YAGVDSGRLV
IMGKYGSGKS ATAILMLLDA LSHRDSLDDV QRAKVPVPIF LTTHDWDPRH QNLPEWLITR
LAKEYQFLRS TQDGRSAAAR LVEDGRVALF LDGFDEIPSE LHRDALEEIR RLATFRLVTL
TRDGEFAAAV RTGHHVDAAA VVELLPIPAD EIASYLETRQ TDPMPPQWEK LVTFLRENPD
HILAQALDSP LVLTQLRDAI QDPADIDDLL VADRFESREA VAEYLMDRSI DVAYRRFSRD
TSTVGPDKAR AALGYIAAKM NEENTRDLAW WQIHHWASPI PRILATTVLG VLIGAPVGAL
MFGPLGQYAV RGHTGTLFGA QYLSMMCLVF GLMAGLVSEA RGGRSRRTGR FRWVDWYRGQ
TNSAVGLLFT VAVTMAVGNQ SNYAFGALAG VLAGIVAGYA ARGDHQDHRW IQRSWWITLR
SRLDPVAGAV AGLPIGLTYG LTKEHTQGLV AGIMSAIAFG LMVGFARPTA GIQAVTDPRT
SWLRNHEHAA TFSLAAGLAL GLPLGLKNGL EHGVIAGAVA GVCVGLIVGL GCLIGASDRL
RTTLLFLQLR GHGIPLDGMR FLEDARRKNL LRTVGPLYQF RHPSIQDRLA RTYGQQQTRV
DPEI