Gene Franean1_0888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0888 
Symbol 
ID5669302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1035197 
End bp1036396 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content71% 
IMG OID641239815 
Producthypothetical protein 
Protein accessionYP_001505250 
Protein GI158312742 
COG category[D] Cell cycle control, cell division, chromosome partitioning
[R] General function prediction only 
COG ID[COG0489] ATPases involved in chromosome partitioning
[COG2151] Predicted metal-sulfur cluster biosynthetic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.100388 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.293855 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACATC CGAGGTCGGC CACGACCGAG GATGTCGACC CGCGACCGAG GTTTCCCTTG 
CCACCAGCCC TGCCTTCGTC CGACGCCATC CAGTCCGCCC TCGCGACCGT GCTGGACCCG
GAGATCGGAC GGCCGATCAC CGAACTGGAC ATGGTGGATT CGGCCCACGT TCGCGACGAC
GGATCCGTCG ATGTGGTCGT CCTGTTGACC GTCTCCGGCT GCCCGATGCG GGATGAGATC
ACATCCCGGG TCACCCGCGC CGTCAATGGT GTGGACGGCG TCCGGGACGT ACGGGTGACC
CTCGAGGTGA TGACCGCCGA GCAGCGGACG GCCCTGCACG AGAAGCTGCG GGGCGGCACC
CCGCAGCGGG TCATCCCCTT CGCGCAGCCC GGGTCGATGA CCAGGGTCTA CGGGGTGGCC
AGCGGCAAGG GCGGCGTCGG CAAGTCGTCG GTCACCGTCA ACCTGGCGGC CGCGATGGCC
CGGTCGGGGC TCGCCGTCGG CGTCCTCGAC GCGGACATCT ACGGCCACTC CGTCCCCCGG
ATGCTGGGCA TCGACCGCGC ACCCACACAG GTCGAAAAGA TGATCATGCC ACCGCAGGCC
CACGGCGTGA AGGTCATCTC GACCGGCATG TTCACCCGCG GCAACCAGCC CGTCACCTGG
CGCGGCCCGA TGCTGCACCG AGCCCTCGAG CAGTTCCTCT CGGACGTCTT CTGGGGCGAC
CTCGACGTCC TGCTCCTCGA CCTGCCGCCC GGCACCGGGG ACATCGCGAT CTCCCTGGCC
CAGCTGGTTC CCTCGTCCGA GCTGCTACTG GTGACCACAC CCCAGCTCGC CGCGACCGAG
GTCGCCGAGC GCGCCGGGAC GATCGCCGTC CAGACCCACC AGAACGTCGT CGGCGTGGTC
GAGAACATGG CCTACATGCC GTGCCCGCAC TGCGGCGAAC GCGTCGACGT GTTCGGCGAG
GGCGGCGGCG CGGCCGTCGC CGAGCGGCTG ACCAGGGTGC TCGGCCACGA GGTGCCGCTG
CTCGCCCAGG TCCCGGTGGA CGTCCGCCTC CGGCAGGGCG GCGACTCCGG CAAGCCGCTG
GTCCTCTCCG ACCCCGACTC CGAGGCCGGG AAGGCGCTGC GCGCCGTCGC CGAGCGGCTG
ACCTTCCGCT CCCGCGGCCT GTCCGGCCGC TCGCTGGACA TCAGCCCCGC CCGCCGCTAG
 
Protein sequence
MPHPRSATTE DVDPRPRFPL PPALPSSDAI QSALATVLDP EIGRPITELD MVDSAHVRDD 
GSVDVVVLLT VSGCPMRDEI TSRVTRAVNG VDGVRDVRVT LEVMTAEQRT ALHEKLRGGT
PQRVIPFAQP GSMTRVYGVA SGKGGVGKSS VTVNLAAAMA RSGLAVGVLD ADIYGHSVPR
MLGIDRAPTQ VEKMIMPPQA HGVKVISTGM FTRGNQPVTW RGPMLHRALE QFLSDVFWGD
LDVLLLDLPP GTGDIAISLA QLVPSSELLL VTTPQLAATE VAERAGTIAV QTHQNVVGVV
ENMAYMPCPH CGERVDVFGE GGGAAVAERL TRVLGHEVPL LAQVPVDVRL RQGGDSGKPL
VLSDPDSEAG KALRAVAERL TFRSRGLSGR SLDISPARR