Gene Francci3_2840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2840 
Symbol 
ID3904752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3344701 
End bp3346926 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content71% 
IMG OID637880161 
ProductAAA_5 ATPase associated with various cellular activities 
Protein accessionYP_481927 
Protein GI86741527 
COG category[V] Defense mechanisms 
COG ID[COG1401] GTPase subunit of restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.241934 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGATC AGCTAGAGAT CATCGGCGGG CTGCTGTACC GGGTTATGCG GATCTTGGCG 
GAGGTCGGCG AGGCGTCTCC GGACGAGCTG TGGATGCGGA TGCGTGCGTC CGATGCTGGC
GCCGATCCCG GATGGCGGCG CGGATCAGGT GACGACCCGC GGAGCGCCGT CGCAAGGAAA
CTCGTCCTGC GGGGCGCTGT CTACCTGGCC AGGGCCGGTC ATCTGAGTGA GAGCAACCGC
CGGTGGCAGG CTACCGGAAT CGGCCGGGAT GCGTTGCGGG CGTCCCCGGA CGAGGCCGCG
TGGTGGCGTG ATGTCACGGA GCACAACACT TACTGGCGGG AACACCGGAG CAGCCTTGGC
CTGGTGGACG ACGTCCTCGC GGTGCTGCCC GAACAGACCT GGGTCAGCGT CACGGATCTC
TCGACGGTGG CCGATCTGAC GGTCGATGGC CTCGTCCGCC ATCTGTGCGG CTTCCGCCCC
GAGGGCTGGT CGCGCGTACT CGACCCGGCG GGCCGGCCGC CGCGCGAGGC CCTCTTCACC
GAGGACGAGT ATCGCGACTG GCTGGACTGG TTCGAGGTCG AGGTCGGCGA CCTGACGGCG
GGGCGGGCTG CCGGCGAGCT GCGGCTGCCG CTCGACGACC TCCGCATGCT CGTCGAGTCC
ATCGCGCCGG AGCGGATCCC TCGCCGGGCC TGGCTGGTGC GCGGCGCGGA CCGGCGTGGC
CGTTCGCTGG CGCGGGATCT CTGGCTGGCC GACGAGGTCT GTTCCCTGCC CGGTGACCGG
CTGGAGGTGC GGCCCGGGGT CGACCGGGAG CAGGTGCGCC GGGCGGTCGA CCGCGAACAC
GCCTCGCTCA CCTCGGCCCG GCGCGGCCGG CTCACCAGCG AGTACCACGC CTTCCTGAGC
CGGATGCGGG AGAACGACCT CGTCGTCGTG AACGACCGTG ACGAGTACTA CGTCGGGGAG
ATCCTCGGAC CGCCGGTGTT CGTCGCGAGC GTCGGGGGCG TCGCGGACCT GCAACGGCCG
GTCCACTGGC GCAACGCCGA CGAGCCGGTG AACTACCTCG ATCTACCCGA TCGGGTGGCG
GCCCTGCTCG GCAACGCGGA AGCGCGCATC GTCGATCTTT CCGATTTCGT CGCCGACCTC
GACGCCCTGG TGCCCGCGCC CGCGCCGGTG GCATCGATCG CGACGACGGG GGCCGCCCCC
CAGAGCGCCG ACGCCGACCT GGCCGGCCAG GCCCTCGCCG GCCAGGCCCT CGGCGGGGAC
GATCTGCGCG AGGTCACGGA CGAGTTCGCG GACGGCCTCT TTTACAGCCG GGACTGGTTG
CGCAGGTGCG TCGAGCTTCT GCGGGACCGG CCGCAGCTCA TCTTCTACGG CCCGCCCGGC
ACCGGCAAGA CGTACCTCGC CCGGCGCTTG GCCTGGCATC TCGCCGACGA TCGGCGCGAG
AACGTCACCC TCGTCCAGTT CCATCCGTCC TATATGTACG AGGACTTCTT CCAGGGTTTC
CGTCCGGTGC AGGTGAAGGG CGGCGACGGT GACGCGGCCA CCACCAACCG GATGTCCTTC
GAGCTCGTGG ACGGTCCGCT GCGCCGGCTC GCCACCGCCG CCGAGCTCAA TCCCCGCCAG
GCGTTCTTCC TGATCATCGA TGAGATCAAC CGGGGTGACC TGGCCAGGAT TTTCGGTGAG
CTGTACTTTC TGCTGGAATA CCGCGGGGAG GCCGTCACCA CCCAGTACGC CTCCGCGGAC
ACGCGCGACT TCCAACTACC GAAGAACCTG TTCATCATCG GCACGATGAA CAGCGCGGAC
CGTTCGATCG CCGCCTTCGA CCAGGCGCTG CGCCGGCGGT TCACCTTCGT CGGGCTCCAT
CCGGACGTCG AACCGACCGC GTCCGTGCTG CGCCGCTGGC TTGCGGCAGG CGACCTGCCC
GACGAGGCGG CCCGCCTGCT CGGCGAGCTG AACCGCCGCA TCGACGACCC GGACGCCCGC
ATCGGCCCGT CCTATCTGAT GCGCGACCGC GTCCACCAGA GTGCCGACGG CCTCGACCGG
GTGTGGGAAC ATCAGATCCT GCCGCTGCTG GCCGAACACC ACGTCGGCGA GACCATCGAT
CTCGCGGCCC GGTACGGCCT ACCGGCGCTG CGGCAGTCCC TCGGCCTGGC CGCGGTGGTT
CCGGCCGCGG TGGTTCCGGC CGCCGGAGTC CCCGCGAACA CCATCCCCGG CCCCGCGGCA
GGCTGA
 
Protein sequence
MADQLEIIGG LLYRVMRILA EVGEASPDEL WMRMRASDAG ADPGWRRGSG DDPRSAVARK 
LVLRGAVYLA RAGHLSESNR RWQATGIGRD ALRASPDEAA WWRDVTEHNT YWREHRSSLG
LVDDVLAVLP EQTWVSVTDL STVADLTVDG LVRHLCGFRP EGWSRVLDPA GRPPREALFT
EDEYRDWLDW FEVEVGDLTA GRAAGELRLP LDDLRMLVES IAPERIPRRA WLVRGADRRG
RSLARDLWLA DEVCSLPGDR LEVRPGVDRE QVRRAVDREH ASLTSARRGR LTSEYHAFLS
RMRENDLVVV NDRDEYYVGE ILGPPVFVAS VGGVADLQRP VHWRNADEPV NYLDLPDRVA
ALLGNAEARI VDLSDFVADL DALVPAPAPV ASIATTGAAP QSADADLAGQ ALAGQALGGD
DLREVTDEFA DGLFYSRDWL RRCVELLRDR PQLIFYGPPG TGKTYLARRL AWHLADDRRE
NVTLVQFHPS YMYEDFFQGF RPVQVKGGDG DAATTNRMSF ELVDGPLRRL ATAAELNPRQ
AFFLIIDEIN RGDLARIFGE LYFLLEYRGE AVTTQYASAD TRDFQLPKNL FIIGTMNSAD
RSIAAFDQAL RRRFTFVGLH PDVEPTASVL RRWLAAGDLP DEAARLLGEL NRRIDDPDAR
IGPSYLMRDR VHQSADGLDR VWEHQILPLL AEHHVGETID LAARYGLPAL RQSLGLAAVV
PAAVVPAAGV PANTIPGPAA G