Gene Franean1_2179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2179 
Symbol 
ID5670579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2613266 
End bp2614357 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content65% 
IMG OID641241100 
Productintegrase catalytic region 
Protein accessionYP_001506521 
Protein GI158314013 
COG category[L] Replication, recombination and repair 
COG ID[COG3415] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.376991 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCTAA CTGATAATCA ACGCAATATT CTCGAGTCCT TGGCTGGAGG CGGTGGGTAC 
GACAGGTCTG CGGCTGCCCG CGCCCGTATG GTGCTATGGC GGGACGAAGG ATTCTCAGTG
CGGGAAATAG CCGAGAAGGC GGGCGCGTCG AAGCCTACCG TGCGACTGTG GCTGTCGCGC
TATGACGAGG AGGGGCCGGA CGGCTTGCTG AGCCGGGTGT CCCCGGGGCG GCCACGGGAG
GTCCCGGGGC GGGTACGGGC GCGGATCCTG GCGTTGACCA GGACCACCCC TCCACCGGAG
ACCGGACTGA GCCACTGGAC GAGCACCGAG ATGGCGCGGT ACCTGAAGCG CCGCGAAGGA
GTGTCGGTCT CGCACACCTT CGTGGCCCAG CTGTGGCGGG AGAACAATCT CCAGCCGCAC
CGGCACCGAG TCTTCAAGCT CTCGGCGGAC CCGGATTTCG AGGCCAAGGT GGAGGACGTC
GTCGGCCTCT ACCTTGATCC CCCCGAGGGC GCCGAGGTCC TGTCGATCGA CGAAAAGCCT
GGGGTGCAGG CACGCGACCG GACGCAGCCA CCGCGGCCGG TCGCCTCCGG CCGGGTCGCC
ACCCGCACGC ACGACTACCA GCGGAAGGGC ACGACCGACC TGTTCGCCGC CCTCGACGTC
GGGACGGGGC GGGTCACCGC CAGGTGCTTC CCCAGCCACA CCAGGGCCGA TTTCCTCACG
TTCATGGACC AGGTCATCGC GGAATACGGC GGTGCGGAGC TCCATGTCGT GGTCGACAAT
CTGGCCACCC ACTACGGCCC CGACGTCGAC ACATGGCTAC GCAGACACAA GAACGTCACG
TTCCATTTCA CCCCGTCCGG CAGTTCATGG CTCAACCAGG TCGAGAACTG GTTCGGTATT
CTCACCCGGC ACGCACTCCA GCGCGGGGCG TTCGTCTCGG TTCAGGACCT CGTCAACACC
ATCAACAACT ATGTCAAGAA CTGGAACTGG GACGCCCATC CGTTCGAGTG GACAGCCACC
ACAGAAGAGA TCGTAGCCAA GGTGGAGGTA ATCCACCGGG AATTCAGGAA GCTGCTCGCC
AACAACTTGT GA
 
Protein sequence
MILTDNQRNI LESLAGGGGY DRSAAARARM VLWRDEGFSV REIAEKAGAS KPTVRLWLSR 
YDEEGPDGLL SRVSPGRPRE VPGRVRARIL ALTRTTPPPE TGLSHWTSTE MARYLKRREG
VSVSHTFVAQ LWRENNLQPH RHRVFKLSAD PDFEAKVEDV VGLYLDPPEG AEVLSIDEKP
GVQARDRTQP PRPVASGRVA TRTHDYQRKG TTDLFAALDV GTGRVTARCF PSHTRADFLT
FMDQVIAEYG GAELHVVVDN LATHYGPDVD TWLRRHKNVT FHFTPSGSSW LNQVENWFGI
LTRHALQRGA FVSVQDLVNT INNYVKNWNW DAHPFEWTAT TEEIVAKVEV IHREFRKLLA
NNL