Gene Francci3_1379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1379 
Symbol 
ID3906581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1656509 
End bp1658032 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content71% 
IMG OID637878716 
ProductXRE family transcriptional regulator 
Protein accessionYP_480485 
Protein GI86740085 
COG category 
COG ID 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.464445 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0167147 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTCA CGACCGGTCC TTCGGCAGTT CCCCGCGCAC CGGTGGCCTC CGCGCCAACA 
CCTGAGGTAT CCGGTGGTGA CAGGACGCCC ACCACCCGCA CACCCAACAC CCGCACACCC
AACACCCGGC TGCGACTGAG TCGGGAGGAA CGGGGCTGGT CGCAGGAACG GCTCGCCACC
GAGATCCGTC GGTTCTCCGT CATCCACGAG GGTCGCGAGG CCGGGGTGAC CGGCAACATG
ATCTGCAAGT GGGAGAAGGG CGACAAGAAA CCAAGCCTGC GTTATCAGCG GTTGTTGCGG
GCGCTATTCG GCCGGTCGTC GGCGGAGCTC GGTTTCGTGG ATGACGAAGC GCTTCCCGCC
GCCGTCGGAG CACTGCGGGG CGGAACCGGT CAGGGTACGG CCGTCGCCGC GTCGATGAAC
CTTGAACAGG CGGCCTGCGA GACGGGCGCC GCGGGCACAC TCGTACTTCA CCCCGGCACC
GACATGATCG ACACGAACGG TATTCCCGTG GAACGTCGGG ACTTTCTCCG GCTCTTCGCC
GCGGCGGGCG GCGTCGCCGT CGTGCCCATG AACGGCGTCA CGGACACCCC GCCCTGGGAG
CGCCTCTCGG CGGCGCTGCG TCGCCGTAGC ACGGTGACCC CGGAGCTCGT CAACGAGCTG
GGCCAGCTCA CCGCGGGGCT GTACGGCTTA GAGGAACGCG TTCCCGCCCG GATCCTGATA
CAGCGGGTCA CCGGGCATCT GAGCAGGCTC AGTCAGCTGC TGGAGACCAC GAGCCGGTCC
CCGGTGCGCC GTGAGCTGAC CTCCACCGCC GGGGAGACGG CGGCGCTCGC CGGATGGCTC
TCGTTCGACA TGAACGACCT GCCCGCGGCC GTCGCCTACT ACCGGGTGGC CATCGAGGCC
GCCCGCGAGG CCGATGACCA CGCCCTGTGG GCCTGCGTAC TCGGTTACGA GAGCTACCAG
ACCGCCAGCC GTGGCCGCCA CGACCAGGCC TGCGCGCTGC TCGGTGAAGC CCAGCGGCGA
GCCGCCACGG GCAGCACGGT GATGACCAGG GCCTGGCTGG CTGCCCGGGA GGCCGAGGAG
CAGGCCGCTC GCGGCGAGGG CCGGGCGGCG CTGGCCGCCC TCGACCGTGC CCAGGATGCC
TTCGACCGGG GTGAGAGCGA CGGGGACCGG GTCTGGACGC AGTTCTTCGA CCGGGGCCGG
CTCGATGGCC TGAAGGTCAC CACCTACACG CGGCTACGGC GACCCGCCGC GGCCTACGCC
GCCGCCACCG AGGCACTGCG GGCGGCCGCT CCCGGTGCGA CGAAGAAGCG GTCGTTGCTG
ATGAGCGACA TCGCGGAGGT GCACATCCAG CGGCGGGAGA TCGAGGCGGC CTGCCACTTC
GCGGCCGAGT CCCTCGCGAT CGTGCTCCAG ACGGACTTCT CGCTGGGCCT GGCCCGTATC
CGCCGCGCCC GGGAGCACCT GCGGCCCTGG CAGCACACCC AGGCCGTGCG CGAGTTCGAC
GAGCAGCTGC GGGCACTCAC CTGA
 
Protein sequence
MQLTTGPSAV PRAPVASAPT PEVSGGDRTP TTRTPNTRTP NTRLRLSREE RGWSQERLAT 
EIRRFSVIHE GREAGVTGNM ICKWEKGDKK PSLRYQRLLR ALFGRSSAEL GFVDDEALPA
AVGALRGGTG QGTAVAASMN LEQAACETGA AGTLVLHPGT DMIDTNGIPV ERRDFLRLFA
AAGGVAVVPM NGVTDTPPWE RLSAALRRRS TVTPELVNEL GQLTAGLYGL EERVPARILI
QRVTGHLSRL SQLLETTSRS PVRRELTSTA GETAALAGWL SFDMNDLPAA VAYYRVAIEA
AREADDHALW ACVLGYESYQ TASRGRHDQA CALLGEAQRR AATGSTVMTR AWLAAREAEE
QAARGEGRAA LAALDRAQDA FDRGESDGDR VWTQFFDRGR LDGLKVTTYT RLRRPAAAYA
AATEALRAAA PGATKKRSLL MSDIAEVHIQ RREIEAACHF AAESLAIVLQ TDFSLGLARI
RRAREHLRPW QHTQAVREFD EQLRALT