Gene Franean1_3701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3701 
Symbol 
ID5672067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4382428 
End bp4383645 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content71% 
IMG OID641242584 
Producthypothetical protein 
Protein accessionYP_001508004 
Protein GI158315496 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGA AGCCCACCGA GTCGAAGATC GAGCTCGACC TCTCCGACGT CGACCACCGG 
GTCGGCCTGC CGATCGGCGG GGGGCAGCTG TGGGACCCGT GCACCGCGAC GGACATCCGC
CGCTGGGTGA TGGCGATGGA CTACCCCAAC CCGCTGCACT GGGACGAGGA GTTCGCGCGT
GAGTCCCGGT ACGGCGGCCT GATCGCCCCG CAGTCGATCG CGGTCGGCCT GGACTACGGC
CACGGCTGCG CGCCCGCCTG CGTCGGACGC ATCCCCGGCA GCCACCTGAT CTTCGGCGGC
GAGGAGTGGT GGTTCTACGG CAGCCCCATC CGGGTCGGCG ACAAGTTGGT GCAGGAGCGG
CGTTTCCACG ACTACAAGGT CGCGGAGACG AAGTTCGCCG GGCCGACCAT GTTCTCCCGC
GGGGACACCG CCCACCGCAA CCAGCACGGC GCCCTGGTGG CCCGCGAGCG CTCCACCGCC
ATCCGCTACC TCGCCGCCGA GGCGGAGAAG CGCGGCATGT ACGAGAACCA GGTCGGCGCG
GTGAAGCGCT GGACGAGCGC CGAGCTGGCC GAGATCGAGA AGCTCCGCGA CAGCTGGCTC
CACTCGAACC GCACCGGCCT CTCGCCCGCC TTCGAGGACG TCAGCGTCGG TGACACCCTG
CCGCGGCGGG TGATCGGCCC GCACAGCATC GCCAGCTTCA CCACCGAGTA CCGCGCCTTC
ATCTTCAACA TCTGGGGGAC GTTCCACTGG ACGGCGCCCC CCGGCATCGA GGACCCCTGG
GTCTACCAGG ACCCGGGCTG GGTGGAGGGC TTCGGCTTCG ACGAGGAGGG CGCCCGGATC
GACCCGCGCC TGCGCGACGG GCTCTACGTC GGGCCGTCGC GCGGTCACAT CGACAGTGAC
AAGGCCAGCG AGGTCGGCAT GGCCCGCGCC TACGGCTACG GCGCCACGAT GGGCGCCTGG
TGCACCGACT ACCTCTCCTA CTGGGCCGGC CACGACGGCA TGGTGCGGCA CTCCAAGGCC
AGTTTCCGCC TCCCCGCCTT CGAGGGCGAC GTCACCTACT TCGACGGCGA GGTGGTCGGC
AAGGAGGAGG GCTCGGTGTG GGGCGTGCCG CTGGTCCAGG TGAAGCTGCG GCTCACCAAC
CAGGACGGCG GCGTGCTGGT GGACTGCACC GCCGAGGTCG AGCTGCCGTA CCGGCGCGAC
CGCGTGGCCG GGAGCTGA
 
Protein sequence
MSEKPTESKI ELDLSDVDHR VGLPIGGGQL WDPCTATDIR RWVMAMDYPN PLHWDEEFAR 
ESRYGGLIAP QSIAVGLDYG HGCAPACVGR IPGSHLIFGG EEWWFYGSPI RVGDKLVQER
RFHDYKVAET KFAGPTMFSR GDTAHRNQHG ALVARERSTA IRYLAAEAEK RGMYENQVGA
VKRWTSAELA EIEKLRDSWL HSNRTGLSPA FEDVSVGDTL PRRVIGPHSI ASFTTEYRAF
IFNIWGTFHW TAPPGIEDPW VYQDPGWVEG FGFDEEGARI DPRLRDGLYV GPSRGHIDSD
KASEVGMARA YGYGATMGAW CTDYLSYWAG HDGMVRHSKA SFRLPAFEGD VTYFDGEVVG
KEEGSVWGVP LVQVKLRLTN QDGGVLVDCT AEVELPYRRD RVAGS