Gene Franean1_2810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2810 
Symbol 
ID5671199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3325511 
End bp3326698 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content63% 
IMG OID641241719 
Producttransposase IS4 family protein 
Protein accessionYP_001507139 
Protein GI158314631 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCATCAT GCCAGCAATC TCCCGCCGTG TCGAACCTTC GGATCAGCGG TGAGGGGGAC 
GACATCTCCG GTCTGCTCGC GATGCTAGGC GGGATCACCG ACCCACGGAA AGCGCGCGGG
AAGATCTACA GCCTTTCGTT CATGCTGGCC TCCGCGCTTG TCGCGACGTT GGCTGGGGCG
ACAGGCCTGC GTGAGATCGG CAGCCGGGTC GCTGACTTCG GCCAAGATCT CCTGGCCCGG
CTCGGTGCAC CGTTCGACCA TTTCACTGGA AGATATAGGG CGCCCAGCGA AAAGGCGATC
CGCGCTCTGT TCGAGAAGAT GGACGTCGCG GCCGTCGACG CCGCCTTCGG TGCCTGGTTG
TTCGCGCACG CGGTCTGGGA GCCGGGTGAG GACATCGTCC TCGCCATGGA CGCGAAGGTC
CTGCGTGGAG CCTGGTCCGA GGGGAACAAG CAGGTCACGC TCCTGTCTGC CATGGTCCAC
GCGAATGGGC TCGTCGCCGG GCAGGTCCGC GTCCCCGACG GCACGAACGA GATCACCCAG
GTAGCGGCTC TCCTGGAGAA CCTCCCGGAC ATCTCCGGGC CTGTCGTCGC GACGTTGGAT
GCCGTGCACA CCCAGCACGA AACCGCCTTC CTCCTTGTCG AACACGGAAT CGACTACGCG
CTGACCGTGA AGGGAAACCA GCCGACGCTC TACCGGAAAA CCTTCGAGCA AACCCTTCCC
CTTCTCCAAA AACCCCCACA GCACGAAGTC GAGGAACGCG GCCACGGCCG AATCAAGAAA
TGGCAGGCCT GGACCACCGA AGCCAAGGGG ATCGGGTTTC CGGAGGTCGC GACCGCTGCC
GTCATCCGTC GCGACGAATT CGACCTCAAG GGAATCCGTG TCAGCCGTGA ATACGCTCAC
ATTCTCACCA GCGTCGCCGG CAACCGCGCC ACTGCCGCCT ACATTCACAG GCTCATCCGC
GGGCATTGGG GCATCGAGAA CGAAATTCAC TACCCGCGCG ACACCGCGTG GCGCGAGGAC
GCCAACCAGA CCCGCACAGG AAATAGTCCA CACACGCTCG CCAGCTTCCG TAACCTGGCG
ATCGGGATCA TCCGCCGAAA CGGCATCAGG AAGATAAAGG AAACCCTCGA GTACATAGCC
GGCGACCGCG ACCGGGTACT CCCACTCCTC GCTACGGCAT GTCACTAA
 
Protein sequence
MPSCQQSPAV SNLRISGEGD DISGLLAMLG GITDPRKARG KIYSLSFMLA SALVATLAGA 
TGLREIGSRV ADFGQDLLAR LGAPFDHFTG RYRAPSEKAI RALFEKMDVA AVDAAFGAWL
FAHAVWEPGE DIVLAMDAKV LRGAWSEGNK QVTLLSAMVH ANGLVAGQVR VPDGTNEITQ
VAALLENLPD ISGPVVATLD AVHTQHETAF LLVEHGIDYA LTVKGNQPTL YRKTFEQTLP
LLQKPPQHEV EERGHGRIKK WQAWTTEAKG IGFPEVATAA VIRRDEFDLK GIRVSREYAH
ILTSVAGNRA TAAYIHRLIR GHWGIENEIH YPRDTAWRED ANQTRTGNSP HTLASFRNLA
IGIIRRNGIR KIKETLEYIA GDRDRVLPLL ATACH