Gene Franean1_6303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6303 
Symbol 
ID5675784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7659082 
End bp7660332 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content68% 
IMG OID641245156 
Productintegrase family protein 
Protein accessionYP_001510551 
Protein GI158318043 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.66477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGGC AGGACCCTCG CGGTGACGAG GACACCCCGG ACGAGTCGAC CGGCAGGAAG 
AAGAAGCAGA ACCGCCGGGC GCAGGGCGAG GGCTCGGTGT ACTGGCGTGA GGACCGTCAG
CGTTGGGTCA TCGAGATCGA CTACGGAGTG GTGAACGGCC GGCGCAAGCG CGTGCCGCGC
TACTTCCGGA CGCAGGAAGA GGCGATCGAG GAGCAGCGGA AGGCGCGGCA GAGCAAGGCG
GACGGGCTGA CCACCCTCGA CCGGCGGTCG CGGTTCGCGG ACTTCCTGAC GTACTGGCTG
GACGAGATCG TCGACCCGTC TGAACGGGCG GAGTCCACGA AGTCGAACTA CCGCGTCATG
GTGAACAACC ACATCCGCCC GGCGCTCGGC TCGCGCCGGC TCGTCGAACT GAAGCACGAG
GATCTTCAGC GGTTCCTGAA CCGCAAGGCG GCGGATGGGT ACAGCACGTC GACCATGCGC
ACGCTGCGTT CCGTCCTGCG CCAAGCGCTC AACGAAGCGG TCATCACCGA GAAGATCAGT
CGCAATGTCG CCGAGACGCT GCGGGTCCCG AAAGCGCGGA AACCGAAGCG GAATGTGGCC
GCGCTGAGCA GGGACGACGG GCTGCGGCTG CTCGCCGAAG CGAAGTCCAC CCGGCATTAC
GCGCTGTATG TGCTGCTGGC GATGGTCGGT CTACGCCGTG GGGAAGCGCT CGCGTTGCGC
TGGTCCGACT TCGACGAATC GGCAGGCACG CTGCGGGTGG TACGCCAGGT GACCCGGGTG
AGCGGCGTGA AGGGGCTGGT CGTCGGCCCG ACGAAGAGTC AGGCCGGAAC ACGCACGCTC
ACGCTGCCGA CCCGATGCGT CCGGGTGCTC CAGGCACACC GCACCGCCCA GCACGCCCAC
CGGCAGGCCG CGGGGAAGCG GTGGAAGGAG AACGGGCTGA TCTTCCCGAG TACCGTCGGC
ACGCACATGG AGCCGCGCGG GCTGAACACC CACCTGTCCA AGCTGTGCCA GCGTGCCGGG
CTGCCGCACC TCGGCCCGCA CGCGCTGCGG CACACCGCGG CCACGATGGC CTACGCGCTC
GGTGTCGACT GGAAGCAGAT ACAGCAGATG CTCGGCCACA CGATGCTGTC GACCACGATG
GACATCTACG TGGACCTGGT CGACAGCGTC CACCGCGACG CGGCGTCCAA ACTCGACGCG
TGGTTCGACG AACCTGATGA AGACGGTGGG CTCAGCCCAG CACAGCGGTA G
 
Protein sequence
MTGQDPRGDE DTPDESTGRK KKQNRRAQGE GSVYWREDRQ RWVIEIDYGV VNGRRKRVPR 
YFRTQEEAIE EQRKARQSKA DGLTTLDRRS RFADFLTYWL DEIVDPSERA ESTKSNYRVM
VNNHIRPALG SRRLVELKHE DLQRFLNRKA ADGYSTSTMR TLRSVLRQAL NEAVITEKIS
RNVAETLRVP KARKPKRNVA ALSRDDGLRL LAEAKSTRHY ALYVLLAMVG LRRGEALALR
WSDFDESAGT LRVVRQVTRV SGVKGLVVGP TKSQAGTRTL TLPTRCVRVL QAHRTAQHAH
RQAAGKRWKE NGLIFPSTVG THMEPRGLNT HLSKLCQRAG LPHLGPHALR HTAATMAYAL
GVDWKQIQQM LGHTMLSTTM DIYVDLVDSV HRDAASKLDA WFDEPDEDGG LSPAQR