Gene Franean1_0494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0494 
Symbol 
ID5668913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp574870 
End bp576030 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content65% 
IMG OID641239423 
ProductWD40 domain-containing protein 
Protein accessionYP_001504861 
Protein GI158312353 
COG category[R] General function prediction only 
COG ID[COG3211] Predicted phosphatase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACGGC GAACGTTCTT GCGTTCCGCT ATCGCGGGAA CCGGCGTGGT CGCCTTCTCC 
GGGGCGATAT GGGACACGGC CCTGGCCGCT CCCGCCCAGA ACGGCTCCAG CCCCTATGGA
TCGCTGTTGG CGGCTGACGC CAACGGGGTC ATGCTCCCCT CCGGTTTCAC TAGCCGCATT
GTCGCCCGTT CTGGTCAGGT GGTTTCCGGA ACCAGCTACA CCTGGCACAA CGCCCCGGAC
GGCGGCGCGG TTTTCCTGAA CGGCACGGGT TGGATGTACG TCTCCAATTC GGAGGTCGGC
AGCAGCGCGG GCGGGGCTTC GGTGCTGCGC TTCGACTCCA GCGGAACCGT CACCTCCGCC
CAGCGCATTC TTTCAAACAC GAGCAGCAAC TGTGCGGGTG GGGCGACTCC GTGGGGCACG
TGGCTGTCCT GCGAGGAGAC CTCCAACGGG CGGGTGTGGG AGACCTATCC GGCCACCGGC
GCGTCGGCTG TCAGCCGGCC GGCCATGGGC CGTTTCAAGC ACGAGGCGGC TGCGTGCGAC
CCGGTCCGCC AGGTCATCTA CCTGACCGAG GACCAGACTG ACGGCTGCTT CTACCGGTTC
CGTCCGACCA CCTGGGGAAA CCTCTCCTCC GGCACGCTCG AGGTGTTGGT TGCGGGATCT
GGAACATCCG GTACGGCAAC CTGGCAGGTC GTCCCGGACC CGGACGGTTC ACCGACCGCC
ACCCGCAGCC AGGTTTCCGG AGCGAAGCAC TTCAATGGCG GCGAGGGCTG CCACTACGCG
AACAACACCG TCTGGTTCAC CACCAAGGGC GACAACCGGG TGTGGGAGGT TCACGTCGAC
ACCAACACAT TCGAACTTGC ATATGACGAC TCGCTGGTAT CTCCTGGACC TGCCCCGCTG
ACCGGTGTCG ACAACATCAC CGGATCGACG TACGGGGACC TTTACGTTGC TGAGGATGGC
GGGAACCTGG AGATCTGCAT CATCACTCCG GACGACATCG TGGCCCCGAT CCTGCGGCTG
GTCGGGCACA ACTCATCCGA AATAACCGGA CCAGCATTCT CCCCGAACGG CCAGCGGCTG
TACTTCTCGT CCCAGCGCGG CACGTCCGGC TCGTCCTCCG GCGGCATCAC CTTCGAGGTC
ACCGGTCCGT TCCGGACCTG A
 
Protein sequence
MERRTFLRSA IAGTGVVAFS GAIWDTALAA PAQNGSSPYG SLLAADANGV MLPSGFTSRI 
VARSGQVVSG TSYTWHNAPD GGAVFLNGTG WMYVSNSEVG SSAGGASVLR FDSSGTVTSA
QRILSNTSSN CAGGATPWGT WLSCEETSNG RVWETYPATG ASAVSRPAMG RFKHEAAACD
PVRQVIYLTE DQTDGCFYRF RPTTWGNLSS GTLEVLVAGS GTSGTATWQV VPDPDGSPTA
TRSQVSGAKH FNGGEGCHYA NNTVWFTTKG DNRVWEVHVD TNTFELAYDD SLVSPGPAPL
TGVDNITGST YGDLYVAEDG GNLEICIITP DDIVAPILRL VGHNSSEITG PAFSPNGQRL
YFSSQRGTSG SSSGGITFEV TGPFRT