Gene Franean1_3580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3580 
Symbol 
ID5671949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4240519 
End bp4241610 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content65% 
IMG OID641242466 
Productintegrase catalytic region 
Protein accessionYP_001507886 
Protein GI158315378 
COG category[L] Replication, recombination and repair 
COG ID[COG3415] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.364035 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCTAA CTGATAATCA ACGCAATATT CTCGAGTCCT TGGCTGGAGG TGGTGGGTAC 
GACAGGTCTG CGGCTGCCCG CGCCCGTATG GTGCTATGGC GGGACGAAGG ATTCTCAGTG
CGGGAAATAG CCGAGAAGGC GGGCGCGTCG AAGCCTACCG TGCGACTGTG GCTGTCGCGC
TATGACGAGG AGGGGCCGGA CGGCTTGCTG AGCCGGGTGT CCCCGGGGCG GCCACGGGAG
GTCCCGGGGC GGGTACGGGC GCGGATCCTG GCGTTGACCA GGACCACTCC TCCACCGGAG
ACCGGACTGA GCCACTGGAC GAGCACCGAG ATGGCGCGGT ACCTGAAGCG CCGCGAAGGA
GTGTCGGTCT CGCACACCTT CGTGGCCCAG CTGTGGCGGG AGAACAATCT CCAGCCGCAC
CGGCACCGAG TCTTCAAGCT CTCGGCGGAC CCGGATTTCG AGGCCAAGGT GGAGGACGTC
GTCGGCCTCT ACCTTGATCC CCCCGAGGGC GCCGAGGTCC TGTCGATCGA CGAAAAGCCT
GGGGTGCAGG CACGCGACCG GACGCAGCCA CCGCGGCCGG TCGCCTCCGG CCGGGTCGCC
ACCCGCACGC ACGACTACCA GCGGAAGGGC ACGACCGACC TGTTCGCCGC CCTCGACGTC
GGGACGGGGC GGGTCACCGC CAGGTGCTTC CCCAGCCACA CCAGGGCCGA TTTCCTCACG
TTCATGGACC AGGTCATCGC GGAATACGGC GGTGCGGAGC TCCATGTCGT GGTCGACAAT
CTGGCCACCC ACTACGGCCC CGACGTCGAC ACATGGCTAC GCAGACACAA GAACGTCACG
TTCCATTTCA CCCCGTCCGG CAGTTCATGG CTCAACCAGG TCGAGAACTG GTTCGGTATT
CTCACCCGGA ACGCACTCCA GCGCGGGGCG TTCGTCTCGG TCCAGGACCT CGTCAACACC
ATCAACAACT ATGTCAAGAA CTGGAACTGG GACGCCCATC CGTTCGAGTG GACAGCCACC
GCAGAAGAGA TCGTAGCCAA GGTGGAGGTA CTCCACCGGG AATTCAGGAA GCTGCTCGCC
AACAACTTGT GA
 
Protein sequence
MILTDNQRNI LESLAGGGGY DRSAAARARM VLWRDEGFSV REIAEKAGAS KPTVRLWLSR 
YDEEGPDGLL SRVSPGRPRE VPGRVRARIL ALTRTTPPPE TGLSHWTSTE MARYLKRREG
VSVSHTFVAQ LWRENNLQPH RHRVFKLSAD PDFEAKVEDV VGLYLDPPEG AEVLSIDEKP
GVQARDRTQP PRPVASGRVA TRTHDYQRKG TTDLFAALDV GTGRVTARCF PSHTRADFLT
FMDQVIAEYG GAELHVVVDN LATHYGPDVD TWLRRHKNVT FHFTPSGSSW LNQVENWFGI
LTRNALQRGA FVSVQDLVNT INNYVKNWNW DAHPFEWTAT AEEIVAKVEV LHREFRKLLA
NNL