Gene Franean1_4822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4822 
Symbol 
ID5673163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5757892 
End bp5759169 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content72% 
IMG OID641243678 
Producttransposase IS4 family protein 
Protein accessionYP_001509094 
Protein GI158316586 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.154643 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGCGT GGCCTCGGTG CCGCTGGGTG GTTTGGTCAC GGTTCGTGGC TTCGATGGTG 
ACCCCGGACC AGGTGTCGGT CGGGGTGCTG GTGACGGCGG TGCCGCGTGA CGCGGTCGAC
GAGGCCGTCG CGGCCTGTGG GGTGGGTGCG CGGCGGGCGG GCGGGAAGCT CCCACCGCAT
GTGACGGCGT ACCTGACGTT GGCGATGTCC CTGTTTCCGG ACGACGACTA CGCCGAGGTC
GCCCAGAAGG TGACCGGGTC GCTGGACCGG TTCGGCTGCT GGGACGCGGC GTGGGCGCCG
CCGAGCGCGA GCGGGATCAC CCAGGCGCGT AAGCGGCTGG GCCGGATGGT GATGGCCGAG
GTGTTCGAGC GGGTCGCGGG CCAGGTCGCG ACACTGTCGA CGCGTGGCGC GTGGCTGCGG
GGCCGGTTGT TGCTCGCGAT CGACGGGTTT GACGTCGACG TGCCCGACAC CGAGGAGAAC
GCGGCCGAGT TCGGCTACGC CGGCACCGGG GAGAAGCGGT CGGCGTTCCC GAAGATCCGG
GTCGTCGCGT TGGCGGAGTG CGGGACGCAC GCGTTCCGGG CCGCCGAGGT CGGTGGCTGG
GCGGCTGGGG AGAGGACGCT GGCCCGCGGG CTGCTGATGC GGCTGAACCG CGACGAGGTG
CTGACCGCCG ACCGTGGGTT CTACTCGTTC GACAACTGGG CGCTGGCCGC GGGCACCGGC
GCCGACCTGA TCTGGCGGGC CCCGACCGGG CTGAACCTGC CGGTCGTGCG GGTCCTGTCC
GATGGCACGT TCCTCACCGT CCTGATCAAC CCGGAGATCA CGGGAGGTCG GCGCCGCGAG
CGGCTGCTCG CCGCCGCGAA GGCCGGCGAC GAGCTTGATC CGGACGAGGC GCACCTGGCC
CGGGTCGTCG AGTACGACAT CCCCGACCGG GCCGGTAACG GTACCGGCGA ACTGGTCGTC
GTGCTGACCA CGATCCTCGA CCCGCGTCAG GCCCGTGCCG ACGAGGTCGC CGCCGGATAC
AACGAGCGCT GGGAGGAGGA AACCGCGAAC GACCAGCTCA AGACCCATCT ACGCGGCCCC
GGGAGAGTCC TGCGCTCCCG GCTGCCGGAC CTGGCGGTCC AGGAGATGTG GGCCTGGCTG
ATCGTCCAGT ACGCGCTCAC CGCCCTGATC GCCGGCGCCG CGGAGGCCGC CGAGATCGAC
CCCGACCGGG TCGGTTTCGC CCGGACACTG CGCCTGGTCC GCCGTTCCGC CACCGGAACG
GCGGACATTT CCCCCTGA
 
Protein sequence
MAAWPRCRWV VWSRFVASMV TPDQVSVGVL VTAVPRDAVD EAVAACGVGA RRAGGKLPPH 
VTAYLTLAMS LFPDDDYAEV AQKVTGSLDR FGCWDAAWAP PSASGITQAR KRLGRMVMAE
VFERVAGQVA TLSTRGAWLR GRLLLAIDGF DVDVPDTEEN AAEFGYAGTG EKRSAFPKIR
VVALAECGTH AFRAAEVGGW AAGERTLARG LLMRLNRDEV LTADRGFYSF DNWALAAGTG
ADLIWRAPTG LNLPVVRVLS DGTFLTVLIN PEITGGRRRE RLLAAAKAGD ELDPDEAHLA
RVVEYDIPDR AGNGTGELVV VLTTILDPRQ ARADEVAAGY NERWEEETAN DQLKTHLRGP
GRVLRSRLPD LAVQEMWAWL IVQYALTALI AGAAEAAEID PDRVGFARTL RLVRRSATGT
ADISP