Gene Franean1_6721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6721 
Symbol 
ID5675792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8173310 
End bp8174794 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content64% 
IMG OID641245570 
Productintegrase family protein 
Protein accessionYP_001510961 
Protein GI158318453 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.917048 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAGGGT CCATCTTCAA GAAATGCGGC TGCCGCGACC CTATGACGAA ACGGCAGCTA 
GGAAGAGCGT GTCCGAAACT TCGCCGTTCG AACGGCGGCT GGCGAACCGA CCATGGAACC
TGGGCCTACC GCATCGACCT GCCGCCCCAC CCTGACGGTA GGCGCCGCCT CGTCAGCCGC
AGCGGATTCC CCACCCAGGC CGAAACGCGC GCAGAATGCG AACGTATCGA AGCCCTCATC
GCCATCCCCG ACGCAGGAGA CAACACCGGC CGCCAAAGCA TCGCCGACAT CATCGGCAAC
GCCATCAGCA CCAACAAGCC CCTCCCTGAC ATCGACGACG TCAAGAACCG ATACCGGCGC
AACGCGGACC TCAACCCCGA CATCACCATC AGCGCCTGGA TGGAACGCTG GCTCGCCAGT
AGAAAGAACA TCCGACCGAA AACCCGCCTG GGCTACGAAA GTTACATCAG AGTCCACATC
ACGCCAGCCA TCGGCTCCGT ACAGCTCACC AAACTCACCG TCTCCCACCT CGACGACATG
TTCACCGCCA TCGACAACAC CAACATCAGG ATCATCCAGG ACCTTCAATC CGACGATCCC
CGAATCCGAC GCGCAGCCCG CGGCAAAAGA CCCACAGGAC CCGCCACCCA GCAGCGCATC
CGAGAAGTAC TCCGCGCCGC AATCAACGAC GCCAACCGCC GAGGACTCAT GACCCACAAC
CCAGCCAAGT ACGTCGAACT CCGCTCCGGA AAACGACCCA AAGCACTCCT GTGGACAGAT
GAACGCGTCG CGCGTTGGCG AGAAACAGGC ACCAAACCCT CACCCGTCAT GGTCTGGACC
CCGACCCAGA CCGGGATGTT CCTCGACCAT GCCCACAGCG ACCCGCTCTA CCCCGTCTAC
CACCTCATCG CTTACCGAGG CCTCCGTCGC GGCGAATCCG TCGCTGTGCA CCTGGACGAC
ATCGACATCA CCGAGGCAAC CCTCACCATC CGCTGGCAGT TCGTCCAGAT CGGCTACGCC
ACCCAACTCG CAAAGCCGAA ATCCGACGCC GGGGACCGTG TCATCTCCCT CGACCCCGAC
ACCCTCGCCG TCCTGAAAGC CTGCAGAACC CGCCAACACA CGGCCCGACT CGCCGCCGGT
ACGGCCTGGC CGAACAACGG CCTCGCGTTC ACCCACCCCG ACGGCAGCCC CATCCACCCC
GAACATCTCA CCAACCGCTT CCAGACCCTG GTCCAGGAAG CCGACCTACC ACCCATCACC
ATCCATGGCC TGCGCCACGG CGCCGCCACC CTCGCCCTCG CCGCCGGAGC CGACCTCAAA
GCAGTCCAAG AGCTCCTCGG CCACTCCACC ATCATGCTCA CCGCCGACAC CTACACCCAG
ATCCTCCCCG ATCTCGCCGC CGAGATCGCC CGCAACACCG CCCGCCTCAT CCCCCGCACC
CGCAGCCCCC ACGACTACCC CCGACACCAC ACCACGACCG ACTAA
 
Protein sequence
MKGSIFKKCG CRDPMTKRQL GRACPKLRRS NGGWRTDHGT WAYRIDLPPH PDGRRRLVSR 
SGFPTQAETR AECERIEALI AIPDAGDNTG RQSIADIIGN AISTNKPLPD IDDVKNRYRR
NADLNPDITI SAWMERWLAS RKNIRPKTRL GYESYIRVHI TPAIGSVQLT KLTVSHLDDM
FTAIDNTNIR IIQDLQSDDP RIRRAARGKR PTGPATQQRI REVLRAAIND ANRRGLMTHN
PAKYVELRSG KRPKALLWTD ERVARWRETG TKPSPVMVWT PTQTGMFLDH AHSDPLYPVY
HLIAYRGLRR GESVAVHLDD IDITEATLTI RWQFVQIGYA TQLAKPKSDA GDRVISLDPD
TLAVLKACRT RQHTARLAAG TAWPNNGLAF THPDGSPIHP EHLTNRFQTL VQEADLPPIT
IHGLRHGAAT LALAAGADLK AVQELLGHST IMLTADTYTQ ILPDLAAEIA RNTARLIPRT
RSPHDYPRHH TTTD