Gene Franean1_4095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4095 
Symbol 
ID5672453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4879616 
End bp4881307 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content76% 
IMG OID641242971 
Producthypothetical protein 
Protein accessionYP_001508388 
Protein GI158315880 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.219157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGC GGGACGGGCG GCAGCCGGCC AGCCGGCGGG GATTTCTCGA CGCGACGCTG 
CGGCTGGCGG CCTCCGGCGG GCTCGCCGCG GTCGCCGGCA CCGGGGCGGC GGGCTGCGAC
GGCCCGGCGC CCGATCCGGG CGGCGCACCG TCGGCCACCG CCGCGCCCCG GCCCTTCGCC
ACCGGCACGC CGAGCCCGAT CGCCGCCACC GACCTGGTGG ACGTCGACAC GCTCTGGGGC
TGGGTCGAGC GGATGACCGC GGTGGGACCG CGGTTCACCG GCTCGGCGGC GCACCGGCAC
CACATCGACG ACCTCGAGCG ACAGCTGCGC GACTACGGCC TCGAGGTGAC CCGCCATCCG
ACCCTGCTGT CGCAGTGGCT CGCGCGGTCC TGGTCGCTGC GGGTCACCGA CGCCACCGGC
GCGACCCGGT CGGTGCCCGT CGCCTACTAC CGGCCGTACT CGGGGGAGAC CGGCCCGGCC
GGTGTCGAAG GACCGCTGGT CGACCTGGGT GCCGGCGCCG AAGCCGACTA CCAGGCCGCC
GGCGCCGGCC TGGCGGGCGC CGCGAGCGGC CCGGATGCTC TGGGAGCCCC AGATGCCCTG
GGCGGCCCGG TCGCGGGGTC GCCGTCCGGG GCGATCGTGC TGGTCGACGC CCCGGTGGCC
CGGCTGCGTG CCTCGGTGCT CGCCGACCTG GCGTACGACG TCCACCCGCC CGAGGCCCGC
GCCGAGTTCG CGGCGGAGGA CTACTCCCGG GTCTGGCTGG GGGTCCCGCC GGCGCCGAGC
CTGGCCACCG CCCGCGCCCA CGGCGCCGTC GGCATGATCG AGGTCTCCGA CATGTCACCC
GCCCTCGCCG CCGGGCAGTA CACCCCGCAC CAGCAGGAGC ACGCCGACCT CCCGGCGCTG
CGGGTCGACC GGGAGCAGGG CGCCGTCCTG CGCGGCCTGC TCGCCCGCGG GCCGGTGCGC
GCCACCCTCG TCCTGGACGC CGACCGGAGC CGCACCACCG TCGACTACCT GCTGGCCCGC
CTGCCCGGAG CCGGGCCGCG CCAGGACCCA CGGGCCGTCC TCGTCGCCAC CCACACCGAC
GGGCAGAACG CGCTCGAGGA GAACGGCGGC CCGGCGCTGC TCGCGCTCGC CGAGTACTTC
AGCCGGTTCC CGGCCGCCAC CCGCCGCCGC GACCTGCTGT TCATGTTCTC TCCGAGCCAC
ATGACCGCCG AGACGGCGAC CGTGAAGCCG GACGAGTGGC TGCGCGAGCA TCCCGACATC
ACGGCGGGGA TCGACATGGC GCTCGTCGCC GAGCACCTCG GCGCCATGGC CTGGGACGAC
GACGGCGGCA CCGGCCCCTA CCGCGCCACC GGCCGCACCG AGCCGGTCGC CGTCGCGGTG
GGCAACAGCG AGACCCTGCG CCGGCTGGCC ACCGACGAGG TCCGCCACAG CGACCTGGCC
CGCACCGGCA TCCAGAAGCC GTTCCAGGAC GGCCTCTACG GCGAGGGGAC CTTCGCCTAC
CGCCTCGGCA TCCCCACCAT CGCGATGATC ACCGGGCCGG CCTACCTGCT CCAGGTCACC
GAGGGCGACA ACCTCGACAA GCTGGACCGG GACCTACTGC ATCGCCAGAC GCTCTTCCTG
GCCCGGCTGC TCGCCCGGAT GACCGAGCTG CCCATCACTT GGCCGCCTCG AACCGCCGAG
CCGCCGAGCT GA
 
Protein sequence
MTARDGRQPA SRRGFLDATL RLAASGGLAA VAGTGAAGCD GPAPDPGGAP SATAAPRPFA 
TGTPSPIAAT DLVDVDTLWG WVERMTAVGP RFTGSAAHRH HIDDLERQLR DYGLEVTRHP
TLLSQWLARS WSLRVTDATG ATRSVPVAYY RPYSGETGPA GVEGPLVDLG AGAEADYQAA
GAGLAGAASG PDALGAPDAL GGPVAGSPSG AIVLVDAPVA RLRASVLADL AYDVHPPEAR
AEFAAEDYSR VWLGVPPAPS LATARAHGAV GMIEVSDMSP ALAAGQYTPH QQEHADLPAL
RVDREQGAVL RGLLARGPVR ATLVLDADRS RTTVDYLLAR LPGAGPRQDP RAVLVATHTD
GQNALEENGG PALLALAEYF SRFPAATRRR DLLFMFSPSH MTAETATVKP DEWLREHPDI
TAGIDMALVA EHLGAMAWDD DGGTGPYRAT GRTEPVAVAV GNSETLRRLA TDEVRHSDLA
RTGIQKPFQD GLYGEGTFAY RLGIPTIAMI TGPAYLLQVT EGDNLDKLDR DLLHRQTLFL
ARLLARMTEL PITWPPRTAE PPS