Gene Franean1_2878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2878 
Symbol 
ID5671267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3392563 
End bp3393654 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content65% 
IMG OID641241787 
Productintegrase catalytic region 
Protein accessionYP_001507207 
Protein GI158314699 
COG category[L] Replication, recombination and repair 
COG ID[COG3415] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.318779 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.565857 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCTAA CTGATAATCA ACGCAATATT CTCGAGTCCT TGGCTGGAGG TGGTGGGTAC 
GACAGGTCTG CGGCTGCCCG CGCCCGTATG GTGCTATGGC GGGACGAAGG ATTCTCAGTG
CGGGAAATAG CCGAGAAGGC GGGCGCGTCG AAGCCTACCG TGCGACTGTG GCTGTCGCGC
TATGACGAGG AGGGGCCGGA CGGCTTGCTG AGCCGGGTGT CCCCGGGGCG GCCACGGGAG
GTCCCGGGGC GGGTACGGGC GCGGATCCTG GCGTTGACCA GGACCACTCC TCCACCGGAG
ACCGGACTGA GCCACTGGAC GAGCACCGAG ATGGCGCGGT ACCTGAAGCG CCGCGAAGGA
GTGTCGGTCT CGCACACCTT CGTGGCCCAG CTGTGGCGGG AGAACAATCT CCAGCCGCAC
CGGCACCGAG TCTTCAAGCT CTCGGCGGAC CCGGATTTCG AGGCCAAGGT GGAGGACGTC
GTCGGCCTCT ACCTTGATCC CCCCGAGGGC GCCGAGGTCC TGTCGATCGA CGAAAAGCCT
GGGGTGCAGG CACGCGACCG GACGCAGCCA CCGCGGCCGG TCGCCTCCGG CCGGGTCGCC
ACCCGCACGC ACGACTACCA GCGGAAGGGC ACGACCGACC TGTTCGCCGC CCTCGACGTC
GGGACGGGGC GGGTCACCGC CAGGTGCTTC CCCAGCCACA CCAGGGCCGA TTTCCTCACG
TTCATGGACC AGGTCATCGC GGAATACGGC GGTGCGGAGC TCCATGTCGT GGTCGACAAT
CTGGCCACCC ACTACGGCCC CGACGTCGAC ACATGGCTAC GCAGACACAA GAACGTCACG
TTCCATTTCA CCCCGTCCGG CAGTTCATGG CTCAACCAGG TCGAGAACTG GTTCGGTATT
CTCACCCGGA ACGCACTCCA GCGCGGGGCG TTCGTCTCGG TCCAGGACCT CGTCAACACC
ATCAACAACT ATGTCAAGAA CTGGAACTGG GACGCCCATC CGTTCGAGTG GACAGCCACC
GCAGAAGAGA TCGTAGCCAA GGTGGAGGTA CTCCACCGGG AATTCAGGAA GCTGCTCGCC
AACAACTTGT GA
 
Protein sequence
MILTDNQRNI LESLAGGGGY DRSAAARARM VLWRDEGFSV REIAEKAGAS KPTVRLWLSR 
YDEEGPDGLL SRVSPGRPRE VPGRVRARIL ALTRTTPPPE TGLSHWTSTE MARYLKRREG
VSVSHTFVAQ LWRENNLQPH RHRVFKLSAD PDFEAKVEDV VGLYLDPPEG AEVLSIDEKP
GVQARDRTQP PRPVASGRVA TRTHDYQRKG TTDLFAALDV GTGRVTARCF PSHTRADFLT
FMDQVIAEYG GAELHVVVDN LATHYGPDVD TWLRRHKNVT FHFTPSGSSW LNQVENWFGI
LTRNALQRGA FVSVQDLVNT INNYVKNWNW DAHPFEWTAT AEEIVAKVEV LHREFRKLLA
NNL