Gene Franean1_4467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4467 
Symbol 
ID5672818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5332965 
End bp5334365 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content73% 
IMG OID641243335 
Productintegrase catalytic region 
Protein accessionYP_001508751 
Protein GI158316243 
COG category[L] Replication, recombination and repair 
COG ID[COG2826] Transposase and inactivated derivatives, IS30 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACGGC CGGCGGATTG GACGCAGACG GTCACGGGGC GGGCGGGGAT GTGCTCGCCG 
GGGCGGCCGC CGGTGGCGCG GTGGGAGCAT CAGCAGCGGT TCTGGGCCGC GGTCGCGCGT
GGGCTGAGCA GCGAGGATGC CGGTGTCGCG GCCGGCGTGT CGCCGGCGGT CGGGACCCGG
TGGTTCCGCA ACTGTGGCGG GATGCCCCCT TCGGACTTCC CCGCCCCGTC GGGCCGATAC
CTGTCGTTCG CGGAGCGGGA GGAGATCGCC CTGGGTCGTG CCCGCGGGGA CAGCATCCGC
CGGATCGCGC GGGGTTTGGG CCGGCCTGCG TCGACGGTGT CACGGGAGCT GCGGCGGAAC
GCCGGTACCC GCGGCGGGAC CCTGGTCTAC CGGGCCACGC TCGCGCAGTG GCATCGAGAC
CGTCGGGCTG CCCGGCCCAA GACTGCCAAG CTCGCAGGCA ACGAGCGGCT GCGTGGCTAT
GTGCAGGACC GGCTCGCCGG GCCGGTCCTG CGCGGCGACG GCACGGTGGT GCCCGGCCCG
TGGACGGCGC CGTTCACCGG CAGGAACAAG CCGCACCGCC AGGACCGCCG TTGGGCCAGC
GCGTGGAGCC CGGAGCAGAT CTCGCGGCGG CTGCGGGTCG ATTTTCCCGA CGACCCGGCG
ATGCGGATCT CGCATGAGGC GATCTACCAG GCCCTGTACA TCGAGAGGCG GGGGGCGTTG
CGCCGTGAGC TGGTCGCGTG CCTGCGTACG GGCCGCGCCC TGCGGGTGCC ACGCGCGCGG
GCCGGACGCC GCCCCGACGG CATGGTCACC CCCGAGGTGC GGATCGGAGC CCGGCCTGTT
GAGGCCACAG ACCGGGCGGT CGCGGGGCAC TGGGAAGGCG ACCTGATCAT CGGGTTGAAC
CGGTCCGCGA TCGGCACGCT GGTCGAGCGC ACCACCCGGC TCACGGTCCT GCTGCACCTG
CCCCGCATGG ACGGCTACGG CCATGAACCA CGGGTGAAGA ACGGTCCGGC GTTGGCAGGC
CGCGGCGCCG ACGCTGTCCG GGACGCGATC ACACGAGCGT TCGCGGAGCT GCCCGAGCAG
CTACGGCGGA CCCTGACCTG GGACCGCGGC AAGGAGATGG CCGGACACGC CGCGCTGACC
GCCGACACGG GCCTGGGGGT CTACTTCGCC GACCCGCACA GCCCCTGGCA GCGCGGCACG
AACGAGAACA CCAACGGGCT GCTACGCCAG TACTTCCCCA AGGGCACCGA CCTGTCCCGC
TGGACCCGCC ACGAACTCGC CACCATCGCC GCGACCCTCA ACGACCGGCC CCGCAAGACC
CTCGACTGGA AGACCCCCAC CGAAGCGATG AACAACCAGC TACTCTCACT TCAACAACCC
GGTGTTGCGA GGACCGGTTG A
 
Protein sequence
MGRPADWTQT VTGRAGMCSP GRPPVARWEH QQRFWAAVAR GLSSEDAGVA AGVSPAVGTR 
WFRNCGGMPP SDFPAPSGRY LSFAEREEIA LGRARGDSIR RIARGLGRPA STVSRELRRN
AGTRGGTLVY RATLAQWHRD RRAARPKTAK LAGNERLRGY VQDRLAGPVL RGDGTVVPGP
WTAPFTGRNK PHRQDRRWAS AWSPEQISRR LRVDFPDDPA MRISHEAIYQ ALYIERRGAL
RRELVACLRT GRALRVPRAR AGRRPDGMVT PEVRIGARPV EATDRAVAGH WEGDLIIGLN
RSAIGTLVER TTRLTVLLHL PRMDGYGHEP RVKNGPALAG RGADAVRDAI TRAFAELPEQ
LRRTLTWDRG KEMAGHAALT ADTGLGVYFA DPHSPWQRGT NENTNGLLRQ YFPKGTDLSR
WTRHELATIA ATLNDRPRKT LDWKTPTEAM NNQLLSLQQP GVARTG