Gene Franean1_6334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6334 
Symbol 
ID5674652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7693220 
End bp7694497 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content72% 
IMG OID641245186 
Producttransposase IS4 family protein 
Protein accessionYP_001510581 
Protein GI158318073 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGCGT GGCCTCGGTG CCGCTGGGTG GTTTGGTCAC GGTTCGTGGC TTCGATGGTG 
ACCCCGGACC AGGTGTCGGT CGGGGTGCTG GTGACGGCGG TGCCGCGTGA CGCGGTCGAC
GAGGCCGTCG CGGCCTGTGG GGTGGGTGCG CGGCGGGCGG GCGGGAAGCT CCCACCGCAT
GTGACGGCGT ACCTGACGTT GGCGATGTCC CTGTTTCCGG ACGACGACTA CGCCGAGGTC
GCCCAGAAGG TGACCGGGTC GCTGGACCGG TTCGGCTGCT GGGACGCGGC GTGGGCGCCG
CCGAGCGCGA GCGGGATCAC CCAGGCGCGT AAGCGGCTGG GCCGGATGGT GATGGCCGAG
GTGTTCGAGC GGGTCGCGGG CCAGGTCGCG ACACTGTCGA CGCGTGGCGC GTGGCTGCGG
GGCCGGTTGT TGCTCGCGAT CGACGGGTTT GACGTCGACG TGCCCGACAC CGAGGAGAAC
GCGGCCGAGT TCGGCTACGC CGGCACCGGG GAGAAGCGGT CGGCGTTCCC GAAGATCCGG
GTCGTCGCGT TGGCGGAGTG CGGGACGCAC GCGTTCCGGG CCGCCGAGGT CGGTGGCTGG
GCGGCTGGGG AGAGGACGCT GGCCCGCGGG CTGCTGATGC GGCTGAACCG CGACGAGGTG
CTGACCGCCG ACCGTGGGTT CTACTCGTTC GACAACTGGG CGCTGGCCGC GGGCACCGGC
GCCGACCTGA TCTGGCGGGC CCCGACCGGG CTGAACCTGC CGGTCGTGCG GGTCCTGTCC
GATGGCACGT TCCTCACCGT CCTGATCAAC CCGGAGATCA CGGGAGGTCG GCGCCGCGAG
CGGCTGCTCG CCGCCGCGAA GGCCGGCGAC GAGCTTGATC CGGACGAGGC GCACCTGGCC
CGGGTCGTCG AGTACGACAT CCCCGACCGG GCCGGTAACG GTACCGGCGA ACTGGTCGTC
GTGCTGACCA CGATCCTCGA CCCGCGTCAG GCCCGTGCCG ACGAGGTCGC CGCCGGATAC
AACGAGCGCT GGGAGGAGGA AACCGCGAAC GACCAGCTCA AGACCCATCT ACGCGGCCCC
GGGAGAGTCC TGCGCTCCCG GCTGCCGGAC CTGGCGGTCC AGGAGATGTG GGCCTGGCTG
ATCGTCCAGT ACGCGCTCAC CGCCCTGATC GCCGGCGCCG CGGAGGCCGC CGAGATCGAC
CCCGACCGGG TCGGTTTCGC CCGGACACTG CGCCTGGTCC GCCGTTCCGC CACCGGAACG
GCGGACATTT CCCCCTGA
 
Protein sequence
MAAWPRCRWV VWSRFVASMV TPDQVSVGVL VTAVPRDAVD EAVAACGVGA RRAGGKLPPH 
VTAYLTLAMS LFPDDDYAEV AQKVTGSLDR FGCWDAAWAP PSASGITQAR KRLGRMVMAE
VFERVAGQVA TLSTRGAWLR GRLLLAIDGF DVDVPDTEEN AAEFGYAGTG EKRSAFPKIR
VVALAECGTH AFRAAEVGGW AAGERTLARG LLMRLNRDEV LTADRGFYSF DNWALAAGTG
ADLIWRAPTG LNLPVVRVLS DGTFLTVLIN PEITGGRRRE RLLAAAKAGD ELDPDEAHLA
RVVEYDIPDR AGNGTGELVV VLTTILDPRQ ARADEVAAGY NERWEEETAN DQLKTHLRGP
GRVLRSRLPD LAVQEMWAWL IVQYALTALI AGAAEAAEID PDRVGFARTL RLVRRSATGT
ADISP