Gene Franean1_0147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0147 
Symbol 
ID5668572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp176507 
End bp177967 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content76% 
IMG OID641239076 
ProductExsB family protein 
Protein accessionYP_001504520 
Protein GI158312012 
COG category[R] General function prediction only 
COG ID[COG0546] Predicted phosphatases
[COG1606] ATP-utilizing enzymes of the PP-loop superfamily 
TIGRFAM ID[TIGR00268] conserved hypothetical protein TIGR00268 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00645126 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCCCCCT GGAGAACATC CACCCTGGTC GTCGGTTTCG ACCTGGACAT GACCCTGCTG 
GACGCCCGCC GCGGGATCGT CGCCACCTTC GCCGAGCTGG CCGCCGAGAC CGGCGTCACG
ATCGACGGCG AGGCCGCCGT GCGCCGTCTC GGCCCCCCGC TCGAGGACGA GATCGCCCGC
TGGTTCCCGG CGGACGAGGT GGCCCGGCGG GCCGCCCGCT ATCGCGAGCT CTACGCGGTG
CACGCGGTGC CCGTCTCCGT CGCGATGCCG CACGCCGCCG AGGCCGTCGA GGCGGTCCGC
AGCGCCGGTG GGCGGGTCGT GGTCGTGACC GCGAAGAGCG AGCCGCTGGC CCGGGCCAGC
CTGGAGCACA TCGGAATCAC AGCGGACCAC GTCGCGGGCT GGCTGTTCGC CGAGACCAAG
GGCGCGGCGA TCGAGGAGCA CGGCGTGGAC GTCTTCGTCG GCGACCACGT CGGGGACGTC
CACGGCGCGC GCGCGGGCGG AGCGGCCTCC GTCGCGGTGC CCACCGGCCC CTGCCCGCCG
GACGACCTGA CCGCGGCCGG GGCGGACGTC GTCCTGCCCG ACCTGGCGGC CTTCCCGGCC
TGGCTGGCGG ACGAGGTGCT CCGCCGGCGC CTGGACACGC TCAGCGCACG GCTGGGTGAG
CTCGGATCGG TGCTGGTCGC CTTCTCGGGC GGCGCGGACT CCGCCTTCCT GCTGGCCGCC
GCCGCGCGGG AACTCGGCCC CGATGCCGTG GTCGCGGCCA CGGCCGTGTC GGCGTCCCTG
CCCGCGGCCG AACTCGACGC GGCCCGCCGC TTCGCCACTG GCCTCGGGGT GCGCCACCTG
TTCCCGGCGA CCGACGAGAT GAGCCACGAG GGCTACCGCG CCAACAGCTC CAACCGTTGC
TACTTCTGCA AGTCCGAGCT CGTCGACACA CTCGCGCCGC TCGCCGCCGA GCTGGGCCTG
GCGCACGTGG TCACCGGCAC CAACGCCGAC GACGCCCGTG CGGGGTTCCG GCCCGGCATC
GGCGCGGCGG CCAGCCGGGG CGCGCGCACG CCGCTGCTGG ACGCCGGTCT GACCAAGGCC
CAGGTGCGCG CCGCCTCCCG CACCTGGGGG CTGCCGACCT GGGACAAGCC GGCGGCGGCC
TGCCTGGCGA GCCGGATCGC CTACGGGGTG CGGGTCAGCC CGGCCCGGCT CGCCCGGGTG
GAGCGTGCCG AGACGGCGCT GCGGGTGGCC ACGGCAGCGG CCGGCCTGCA CATCCGCGAC
CTGCGGGTGC GCGATCTCGG GGACGTCGCC CGGATCGAGG TGGATGCCGA CCACGTGGCC
GGACTGGTGG CCCGTCCTGA TCTGGTCTCG GTGGTCGTCG AGTCCGGTTT CGCCCGCGCG
GAGGTCGATC CCCGGGGCTT CCGGTCCGGC TCGATGAACG AGCTCCTTCC CGCCCCCGGG
CGCCAGACCG AGCCGGCCTG A
 
Protein sequence
MSPWRTSTLV VGFDLDMTLL DARRGIVATF AELAAETGVT IDGEAAVRRL GPPLEDEIAR 
WFPADEVARR AARYRELYAV HAVPVSVAMP HAAEAVEAVR SAGGRVVVVT AKSEPLARAS
LEHIGITADH VAGWLFAETK GAAIEEHGVD VFVGDHVGDV HGARAGGAAS VAVPTGPCPP
DDLTAAGADV VLPDLAAFPA WLADEVLRRR LDTLSARLGE LGSVLVAFSG GADSAFLLAA
AARELGPDAV VAATAVSASL PAAELDAARR FATGLGVRHL FPATDEMSHE GYRANSSNRC
YFCKSELVDT LAPLAAELGL AHVVTGTNAD DARAGFRPGI GAAASRGART PLLDAGLTKA
QVRAASRTWG LPTWDKPAAA CLASRIAYGV RVSPARLARV ERAETALRVA TAAAGLHIRD
LRVRDLGDVA RIEVDADHVA GLVARPDLVS VVVESGFARA EVDPRGFRSG SMNELLPAPG
RQTEPA